r3wp [groups: 83 posts: 189283]
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

World: r3wp

[!REBOL3 Proposals] For discussion of feature proposals

Maxim
13-Jan-2011
[579x5]
we still need to add an extra if before the length? all the time.


its funny because the recent discussion on naming and *? words has 
made me realize that the length? and many others could in fact be 
altered to support this idiom if its done consistently.
we have two questions answered in one function.  using if/either/unless 
allows to filter based on capacity and the result allows us to act 
on the actual value.
another example, but this time applied to the index? function.
		
get-item: func [data][
	either i: index? data [
		pick data i
	][data]
]


note that this can be done differently, but I think this is the easiest 
to read because its almost like an algorithm.


i.e. "if this has an index, get its item at that index, otherwise 
just return the value."
it also prevents us from having to create sets of predefined filters 
which will:

    -be unknown and forgotten quickly (e.g. more things to remember, 
    bad)

    -might only really be usable by a very few functions to begin with. 
    (not very usefull)


(this relates to the alternative that ladislav pointed, using a dataset)
changing many of these functions would make them satisfy the *proposed* 
style convention that *? functions should be truthy, if possible. 
  


in many cases, this is the case, because there is no "possible" none 
return value and errors are returned instead.
Oldes
13-Jan-2011
[584]
we still need to add an extra if before the length? all the time.

 ... actually you would just move the IF to the native part and you 
 will use the IF also in cases where you are pretty sure it's a series 
 and want just its length.
Maxim
13-Jan-2011
[585]
not sure I undestand what you mean   ' :-/
Oldes
13-Jan-2011
[586x2]
never mind.. I'm wrong.. the condition must be in the native anyway 
so it's fine just to return none.
sorry.. I'm babysitting here.. should first think some time before 
I write.
Maxim
13-Jan-2011
[588x2]
hehe... nah just say the kid/baby took over the keyboard  ;-)
unless its a cat!
Gregg
13-Jan-2011
[590]
On one hand, I like the idea that a none LENGTH? means "this value 
has no length", on the other the idea of a pickable! typset (maybe 
a different name though) gives us the ability to use it in func specs 
and such.

I'm not ready to say change LENGTH? without further thought.
Ladislav
13-Jan-2011
[591]
Max, your plan is unrealistic
Maxim
13-Jan-2011
[592x2]
note, that this proposal is strictly related to functions which have 
an '? at the end.  


if we go ahead with changes to the style convention which clearly 
define what a "truthy" function is and state that ****? is generally 
used to define such a function, I think that we could clean up a 
lot of the guess work in how to handle this often recurring case.
in what sense is it unrealistic?
Ladislav
13-Jan-2011
[594x2]
e.g.:

a: make object! [a1: 1]
length? a
pick a 1
, etc....
Maxim
13-Jan-2011
[596]
well, if an object has a length? why can't I pick it?  that is the 
error IMHO.  though this is not the debate I want to start ;-)
Ladislav
13-Jan-2011
[597x2]
You already did
There are many such cases you don't like
Maxim
13-Jan-2011
[599]
obviously not all cases will fit perfectly.  and the above was just 
one (simple) example, where you can get 95% of the work done in one 
line and then tailor the few special cases after.
Sunanda
13-Jan-2011
[600]
Maxim <If an object has length, why can't I pick it?>
Maybe they are like SQL tables.....
A table has a length, so this is valid

   select count(*) from ....   -- to get the length of the table (ie 
   number of rows)

But, in the absence of an ORDER BY the rows do not have a user-accessible 
sequence, so

  select first from ....   -- is not valid syntax, nor (without an 
  ORDER BY) is it even meaningful

(Sorry, I know you did not want that debate)
Maxim
13-Jan-2011
[601]
btw the fact that pick can't use objects isn't related in any way 
to what length? can or cannot do.


I just gave an example to *illustrate* how clean it is to use thruty 
returns in real code.

whatever the algorithm or functions we use we will always have to 
handle the special cases (like 0 in maths)
Ladislav
13-Jan-2011
[602x2]
The fact, that the code actually cannot be used means, that your 
idea is not supported by a real usage example, and is not completely 
thought-out.
Taking into account, that the disadvantages of the change are real, 
while the advantages are just virtual, it is hard for me to support 
it.
Maxim
13-Jan-2011
[604x2]
Lad you are well placed to know that iterating over ANY random data 
list in ERBOL will require special cases in EVERY single algorythm 
you can think of.  every datatype has its own idiosyncracies.  


in a way, this is the point of datatypes in REBOL. They aren't generic. 
 so I don't expect them to behave so in the first place.
give me your perceived disadvantages.  I've been pulling my own weight, 
but I'm not seeing alot of points related to LENGTH? and other such 
funcs...  I'm not talking about a specific function chain.
Ladislav
13-Jan-2011
[606]
Generally, it looks, that REBOL is headed towards "less error triggering" 
approach, and I don't feel any of the changes as a problem, so, you 
may be right. But, there should be some limits, where to stop. One 
of my limits is the usage of NaNs in REBOL, I prefer clean errors 
triggered to NaN dissemination. So, in that case, I know where my 
preferred boundary is. In case of LENGTH? and INDEX? function, I 
am not totally sure, but I am sure, that any change to the existing 
state is expensive (postponing the release, taking resources needed 
to solve more important problems, ...) So, I would support such a 
change only when I would get really convincing arguments.
Maxim
13-Jan-2011
[607x3]
yes, in such a sense (expensive change), I understand.  


I will wait for brian to step up and get his opinion.  I know we've 
spoken about this somewhat before, but not as head-on, for myself... 
 With the recent talk about naming, IMHO it has put a new light on 
the possibility for this to be a bit more appreciated, adding ? at 
the end would now mean something concrete.


let it be known that I do like errors, I am not against errors, I 
have been putting less and less error recovery in my code to make 
sure it crashes and I fix bugs early.


its just that I can see many if not most ****? functions (my own 
included) being truthy and this is very usefull to facilitate  control 
flow.  This, as oposed to always putting error handlers in places 
where the error is only used as a negative reply, for which we can 
supply, IMHO, a reasonable value/standard.
another proposal:


have info? support more datatypes.  it would be very nice for a single 
function to return information about any datatype in a single call.

ex:
>> a: make string! 1000
>> a: next append a "12345678"

== make object! [length: 7 index: 2 size: 8 buffer: 1000 references: 
1]
the references would tell us how massively this string is being used 
within the application and allow us to track if we are leaking data 
in a way that it will never be able to GC it.
Ladislav
13-Jan-2011
[610x3]
That "references:" may be expensive, since I doubt there is code 
doing that now.
(very expensive, I dare to say)
...and, the GC is able to collect a string no matter how many references 
refer to it
Maxim
13-Jan-2011
[613]
on objects info? could just return all the various reflective properties 
it has (words-of spec-of) and a few more like ram being used by object. 
   all types would benefit from this sorely lacking feature in REBOL.
Ladislav
13-Jan-2011
[614]
(REBOL GC does not use reference counting)
Steeve
13-Jan-2011
[615]
really ?
Which algoritm is Rebol using then ?
Ladislav
13-Jan-2011
[616]
I do not know, which one, but about this I am sure
Maxim
13-Jan-2011
[617]
in any case, the idea is to put as much information there as is possible. 
 


this would be VERY usefull for debugging, dialecting, and inspecting 
untrusted coded
Steeve
13-Jan-2011
[618]
I don't like these secrets
Maxim
13-Jan-2011
[619]
one problem with reference counting is cyclical reference.  they 
are hard if even sometimes impossible to collect in some sequences 
of execution.
Ladislav
13-Jan-2011
[620]
(generally, reference counting is not a true garbage collection)
Steeve
13-Jan-2011
[621]
cyclical references is a problem with any sort of algoritm.
Maxim
13-Jan-2011
[622]
yeah but for a garbage collector its a major issue.
Ladislav
13-Jan-2011
[623]
so, the proper GCs don't count references
Maxim
13-Jan-2011
[624]
I do wonder how it knows that all references have been terminated.
Ladislav
13-Jan-2011
[625]
You can understand the whole referencing business as a kind of a 
graph - A refers to B, B refers to C, C refers to A, ...
Maxim
13-Jan-2011
[626x3]
ok, just read a few papers on current GC techniques (including MS's 
pretty competent algorithm)
anyhow... we are a bit off topic from my original post.    I now 
understand why GC is very slow on large datasets.
but its still possible to get that information, maybe it could be 
a refinement, as well as one for MEM stats (if its also heavy to 
resolve)