World: r4wp

[#Red] Red language group

older newer	first last
DocKimbel 16-Nov-2012 [3761]	and I can't afford to have a hole in the list Brian, could you give us some short code cases where this was a problem for you? This would really help.
BrianH 16-Nov-2012 [3762]	pick ser idx + off
DocKimbel 16-Nov-2012 [3763]	Just erroring out on index 0 is ann improvement. That's my intention for Red until we get a consensus on a better overall solution.
Andreas 16-Nov-2012 [3764]	We won't get consensus on this.
DocKimbel 16-Nov-2012 [3765]	We won't if we stick to only R2/R3 options. Fortunately, there are other ones, maybe we'll find more.
BrianH 16-Nov-2012 [3766]	Sorry, I didn't mean off in the sense of false, I meant short for an offset. Any place where you have computed indexes can have a computation that turns out to be less than 1, especially if your base position is offset.
Andreas 16-Nov-2012 [3767x2]	Error out and add PICKZ, POKEZ while at it. That would at least give us a base to work with.
Andreas 16-Nov-2012 [3767x2]	The problem with keeping most of R2's behaviour much longer is that switching to something different will only get harder.
DocKimbel 16-Nov-2012 [3769]	Agreed.
BrianH 16-Nov-2012 [3770]	Remember, R3's behavior isn't done either. There's a standing consistency bug, for instance. If we come to a consensus, R3 is likely to adopt it too.
Andreas 16-Nov-2012 [3771x2]	So if, for example, we were to switch to 0-based indices-as-offsets complemented with an ordinal! type, it's better to do this sooner rather than later.
Andreas 16-Nov-2012 [3771x2]	BrianH: CC for this consistency bug?
BrianH 16-Nov-2012 [3773]	http://issue.cc/r3/609
Andreas 16-Nov-2012 [3774]	Thanks.
BrianH 16-Nov-2012 [3775]	I wouldn't even mind if PICKZ/POKEZ were the actions and PICK/POKE were the native wrapper functions. Not as pretty for port scheme implementors though.
DocKimbel 16-Nov-2012 [3776]	Brian: in `pick ser idx + off`, how often do you need `idx + off` to give you both positive and negative results from the same expression?
BrianH 16-Nov-2012 [3777]	Usually, the number I'm passing to PICK or POKE is computed elsewhere. That was a one-line example for simplicity.
DocKimbel 16-Nov-2012 [3778x3]	I would just like to know if it's an issue (the 0 gap) you hit once in your lifetime or if it's something people encounter from time ot time or even often (depending on the coding style).
	(just curious)
	We should write a short-list of possible options that would solve the whole issue and see if we can get a large consensus on one of them. Anyone kind enough to extract the different options we've discussed and put them somewhere online with the main pros/cons?
BrianH 16-Nov-2012 [3781x3]	I use computed indexes for computed lookup lists, such as for precomputed intermediate results of computations, translation tables, etc. If the computation uses signed numbers, you have to do an offset base position to get the results from the positions less than 1. Having a hole slows down the computation because it has to be handled in mezzanine code. PICKZ/POKEZ would actually be better for most of these situations because the computations work better with 0-based numbers (modulus, for instance). It's pretty common in code that actually needs to use PICK/POKE on series.
	I've found that aside from computed indexes, the only times I use PICK/POKE on series are for negative/0 indexes, or indexes greater than 10. For the rest the ordinal functions are faster.
	Of course port schemes and datatypes actually need to implement PICK/POKE, but at least for datatypes the implementation is usually native so the hole is less expensive to implement.
Andreas 16-Nov-2012 [3784x2]	It's pretty common in code that actually needs to use PICK/POKE on series. That's the sticking point. We can categorise the uses for PICK/POKE: (a) with large positive literals (b) with literals -1, -2 (c) with computed indices and series in "head position" (d) with computed indices and an offset series
Andreas 16-Nov-2012 [3784x2]	More categories welcome.
DocKimbel 16-Nov-2012 [3786]	Having a hole slows down the computation because it has to be handled in mezzanine code. Have you ever had to actually write such "0 gap" workaround in your code, or is Carl the only one that had ever been bitten by this isssue?
Andreas 16-Nov-2012 [3787]	I think it's more common to rewrite the code to something different, where possible, than to try and workaround the 0 gap.
BrianH 16-Nov-2012 [3788x2]	Anyone who does computed indices from offset base positions would run into that issue. However, given that this only tends to happen in heavy-math code, I wouldn't be surprised if Ladislav and I would be caught by it more often than anyone else in the REBOL community, even Carl.
BrianH 16-Nov-2012 [3788x2]	Maybe the other Saphiron guys too.
Andreas 16-Nov-2012 [3790x2]	It will also happen in data-heavy code.
Andreas 16-Nov-2012 [3790x2]	E.g. pure-REBOL database implementations an such.
BrianH 16-Nov-2012 [3792]	I use that pattern in structure-tricky code, like those databases Andreas mentioned.
Andreas 16-Nov-2012 [3793]	I'm pretty sure I ran across this problem when trying to implement a triple store ages ago (and ended up switching to a different implementation approach).
BrianH 16-Nov-2012 [3794x2]	I'm not as much of a math head as Ladislav, so I can't always do the algebraic transformations necessary to switch to unsigned arithmetic.
BrianH 16-Nov-2012 [3794x2]	Strangely enough, Red's native code compiler means that I'm more likely to use it for this than I would REBOL. Mathematical computation isn't REBOL's strong suit.
DocKimbel 16-Nov-2012 [3796]	Brian: that's also the kind of usage where Red should give you the best performances, as math expressions can be highly optimized.
BrianH 16-Nov-2012 [3797]	Yup. But that doesn't mean that I can switch to unsigned arithmetic. For that matter, sometimes switching to unsigned arithmetic means mostly indices at the top and bottom of a large range, skipping the middle (basically what you'd have to do with the Python model). By "large" I mean allocating a 2GB series or more and only using the 100MB or so at both ends. Not always a good idea.
Oldes 16-Nov-2012 [3798]	If you don't want to be hit by the pick 0, you can stay at head position of the series cannot you?
BrianH 16-Nov-2012 [3799x2]	You can stay at the beginning of the series if you add the base offset to every calculated index reference. Bad idea in REBOL, too slow, but maybe Red's optimizer will be smart enough to undo that calculation. How many algebraic transformations will the optimizer be able to handle?
BrianH 16-Nov-2012 [3799x2]	For that matter, in R3 (where I don't get caught by the 0 hole for PICK/POKE, just for AT) I have to do an offset-forward-by-one reference to avoid having to add 1 to every calculated index reference. Doesn't help in R2 though.
DocKimbel 16-Nov-2012 [3801]	Hard to answer that now, but as the optimizing compiler will have a plugin architecture, it won't be limited, in the sense that you'll be able to add your own specific code optimizations, that if generic enough, could make its way into the official repo. If the optimization processing time is short enough, it can be used for the JIT, else, it will be used only in for AOT compilations.
BrianH 16-Nov-2012 [3802x2]	In that case, it would help if PICKZ/POKEZ were the actions because they don't have the 0 hole, and we can add PICK/POKE optimizations that apply an algebraic transformation to change the call to PICKZ/POKEZ. Implementing the hole in the action has overhead that the optimizer wouldn't be able to undo. It would be able to undo the hole implementation in a wrapper function though by inlining the code then optimizing it away. That would make up for the problems caused by putting the 0 hole in there in the first place.
BrianH 16-Nov-2012 [3802x2]	Though a type inference action-to-native transformation would help too :)
DocKimbel 16-Nov-2012 [3804]	For PICKZ/POKEZ, we'll see if they still make sense once we come up with a global solution.
BrianH 16-Nov-2012 [3805x4]	Most computed index situations use modulus, which generates 0-based numbers. If there's no 0 hole in PICK/POKE then you can do PICK/POKE NEXT, as long as this doesn't have side effects the way it does for port schemes or weird copy-on-write stuff; in that case PICKZ/POKEZ would be more of a convenience. If there's a 0 hole in PICK/POKE, you absolutely need a version of the functions without that hole, and having the extra advantage of being 0-based would be even better, an extra justification for their existence.
	Keep in mind though that another occasion when you use computed indices with PICK/POKE is when you do stuff like s/(x) or s/(x): val, since path references for series should be using PICK/POKE internally.
	One occasion where you use computed indexes is when you do C-style fake multidimensional array access. As with C, however, the math tends to be 0-based because of the modulus, so a 0 hole really hurts here. You can do these calculations much more easily if your base series position is 2 (or plus 1 * (dimension - 1) for each dimension) because the 0's go back one.
	(sorry, there's two standard ways to pluralize index and I don't use either consistently)
Jerry 16-Nov-2012 [3809]	Doc, Will you publish a Red/System standard function guide, listing all the functions, such as: print, print-line, length?, size?, zero?, form-type, ...
Gregg 16-Nov-2012 [3810]	- "the most sane way to make a decision here is to come up with use cases": +1 - We are unlikely reach consensus: +1 - Have I ever been bitten by R2's behavior? No. Great discussion on this. I greatly appreciate Andreas's points about common (-1, -2) cases, and his gist example. I'm good with throwing an error where silent incorrect processing would be worse. The concrete examples will be the best way to drive this, IMO. By concrete, I mean real-world, "this is code I actually wrote to solve a problem" stuff. If you can't post code for legal reasons, a generalized adaptation is almost as good. Make the most common uses behave correctly, even if there are cases they don't handle, and show how the other cases can be handled, even if it means slower mezz code. Then see if there's a better way to support them.
older newer	first last