The now even-more-special SPECIALIZE

hostilefork · April 12, 2018, 12:27pm

An early-ish addtion to Ren-C was SPECIALIZE. It creates a variation of a function which has some parameters removed, because they have been set to constant values.

The syntax is that you give it an ACTION! and a block of code. (The action can be itself specialized via GET-PATH!). It will bind the SET-WORD!s in that block to the parameter names of the function you are specializing. Then it will run it:

>> append-abc: specialize :append/only [
    print "block runs arbitrary code!"
    if 1 < 2 [value: [a b c]]
]
block runs arbitrary code!

>> block: copy [x y]

>> append-abc block
== [x y [a b c]]

(Note: One disadvantage of this particular syntax is that it ties you to the names of the parameters. That's not an intrinsic flaw of the specialization mechanic, just this SPECIALIZE native, and other ways of making the generator are possible if people wanted to.)

Not a frivolous feature--a CRITICAL one

Whatever the syntax of specializations, they were very important to add. They're key to reducing the size of the language definition. Because otherwise, you haven't clearly captured the relationships between functions...you wind up with a lot of slightly different code saying the same thing. Each re-implementation is a chance for introducing some weird variation!

For instance: in R3-Alpha, how were FIRST, SECOND, THIRD, ... defined? They were all their own NATIVEs...each one implemented with the C function Do_Ordinal(). As it so happens, the implementation mechanism was very dicey. But it also raises a question for anyone who wants to write ELEVENTH... do you have to crack open the C to do that?

By defining first: specialize :pick [picker: 1], the pattern is much clearer. This mode of cleanup was applied in many places, avoiding writing more C code when the relationship could just be established in Rebol.

But it had to be fast...

For specialization to be palatable, the design had to be done in such a way that it didn't get slower.

Consider in R3-Alpha: what if you wanted a version of APPEND that always had /ONLY, but wanted to expose all the existing refinements and documentation? Let's do it the "easy" way (which makes assumptions that a correct answer would need PARSE and a whole lot of edge case processing to get right)

r3-alpha> spec: spec-of :append
r3-alpha> remove/part (find spec /only) 2
r3-alpha> apo: func spec [
    apply/only :append reduce [
        :series :value :part :length :only :dup :count
    ]
]

r3-alpha> delta-time [block: copy [] loop 100000 [apo block [a]]]
== 0:00:00.114358

r3-alpha> delta-time [block: copy [] loop 100000 [append/only block [a]]]
== 0:00:00.034088

So now you have something that's almost 4x as slow. In Ren-C:

ren-c> apo: specialize 'append [only: true]

ren-c> delta-time [block: copy [] loop 100000 [apo block [a]]]
== 0:00:00.053084

ren-c> delta-time [block: copy [] loop 100000 [append/only block [a]]]
== 0:00:00.047722

It's nice to see Ren-C is indeed within striking distance performance-wise on the plain APPEND/ONLY. (The evaluator is doing an incredible lot more for that price, but I still aim to narrow that gap...as time permits). But that aside, think of the specialization in relative terms...it added only about 12%.

Driving this margin down makes us comfortable defining one function in terms of another, without feeling anxious and thinking it has to be done in C.

Partial Refinements

One thing that the early SPECIALIZE could not do was let you provide refinements without all of their arguments. That is because what it was doing was storing a value in each argument position. And TRUE for a refinement isn't enough to know where the order is.

This blocked an interesting feature, namely being able to specialize functions with GET-PATH!. Consider these situations:

foo: func [/ref1 arg1 /ref2 arg2 /ref3 arg3] [...]

foo23: :foo/ref2/ref3
foo32: :foo/ref3/ref2

foo23 A B ;-- should give A to arg2 and B to arg3
foo32 A B ;-- should give B to arg2 and A to arg3

Merely filling in the slots for the refinements specified with TRUE will not provide enough information for a call to be able to tell the difference between the intents. Also, a call to foo23/ref1 A B C does not want to make arg1 A, because it should act like foo/ref2/ref3/ref1 A B C.

To make a long story short: This works now! And reasonably efficiently.

Binding footnote

The block SPECIALIZE takes is a block of code where SET-WORD! is bound to the arguments of the function. Regular words are not. It really runs whatever code otherwise, so you can put IF statements in it, or whatever you like:

>> value: 10

>> apV: specialize :append [if 1 < 2 [value: value] | print "runs code!"]
runs code!

>> block: copy []

>> apV block
10

The idea that a WORD! and a SET-WORD! have different bindings in a block of code is a bit odd but not without precedent. e.g. in C++, class C {int m; C (int m) : m (m + 1) { ... }} has a member initialization syntax, where the m (m + 1) means "initialize member m with the parameter m plus 1". (It's assumed you aren't interested in initializing the member in terms of itself.)

Whether this is a great idea or not, it has seemed useful so far. Feedback would be useful, as binding remains one of the great bugaboos of the language.

BlackATTR · April 12, 2018, 1:02pm

Binding is one of the great WTFs of the language. Still, SPECIALIZE seems like it offers a lot of benefits. Looking forward to exploring this. Straight-up fiyah!