Jonathan,
> Thanks ... saves me doing it. I may have a quick play with the
> numbers now ...
One quick play in my current environment (just using Ved to answer mail,
but with a lot of local-stuff precompiled):
lvars lll = dic_numbers();
lvars vec = initshortvec(25);
lvars x, total = 0, maxnum = 0;
for x in lll do
x + total -> total;
max(maxnum,x) -> maxnum;
1 + vec(x+1) -> vec(x+1);
endfor;
;;; average bucket size
total/length(lll)+0.0 =>
** 7.88367
;;; largest size
maxnum =>
** 24
;;; distribution of sizes
vec =>
** <shortvec 0 10 23 31 62 112 115 131 149 104 87 69 50 32 22 12 8 2 3 0 0 0 0 0 1>
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15....
[Bucket size]
Is that a poisson distribution? I don't recall the definition.
Certainly it is skewed to the left, as it should be.
Number of words
countwords()=>
** 8106
>
> >> IIRC, it was
> >> very simple: some combination of first, second and last character
> >> with the length of the word, or was that long ago and far away?
> >
> >Your memory is almost right. here's the code in syspop11 in
> [snip]
>
> Yes, you were too quick! I've just rebooted back into Windows
> after having a look at it. (I must try out some Linux
> newsreaders ...)
I use the Bham versions of ved_gn, ved_postnews, etc., which use a
socket connection to the site in the environment variable NNTPSERVER
You can get the package here:
http://www.cs.bham.ac.uk/research/poplog/vedgn.tar.gz
55568 bytes Jun 30 2002
There's a summary in HELP ved_gn
http://www.cs.bham.ac.uk/research/poplog/help/ved_gn
If you are going to do that and you want to email some posters,
use the up to date ved mail package.
http://www.cs.bham.ac.uk/research/poplog/vedmail.tar.gz
90413 bytes Nov 22 17:04 2003
It re-defines ved_send(mr) to use sendmail so you may have to do some
tweaking of sendmail stuff in the /etc/ directory.
I should change it one day to allow alternative mailers. It used
to pipe through 'mail' but that turned out to be too limiting.
I forget why.
Some of the details are here:
http://www.cs.bham.ac.uk/research/poplog/help/ved_getmail
Alternatively if you are now a wysywig person, install mozilla (not
netscape), from www.mozilla.org
It is free, easy to install, fast, robust, has improved beyond
recognition over the last two years and is getting better all the time.
More and more web sites now comply with standards so that they work with
mozilla (e.g. even www.pcworld.co.uk and www.philips.co.uk after many
complaints, though there are still some stupid banks, etc. that don't).
You can also install it on windows, and have the same environment.
You could set one of them not to delete from the mail server when
reading.
One day it should be possible to browse teach files in something like
mozilla and tell the browser to pass examples to the compiler.
There are programmer interfaces, but I've not looked at them, and
do not plan to.
(Firebird, a smaller, slightly faster, derivative of mozilla, without
mail, news or html composer, has many progrmmed extensions available.)
Aaron
|