This is the new home of the egghelp.org community forum.
All data has been migrated (including user logins/passwords) to a new phpBB version.


For more information, see this announcement post. Click the X in the top right-corner of this box to dismiss this message.

Soundex can be used as a drone scorer

Help for those learning Tcl or writing their own scripts.
Post Reply
User avatar
awyeah
Revered One
Posts: 1580
Joined: Mon Apr 26, 2004 2:37 am
Location: Switzerland
Contact:

Soundex can be used as a drone scorer

Post by awyeah »

@Just to let people working on drones know:

I found this algorithm earlier from wikipedia. If this can be implmented into TCL, it can be used as a scorer for drone nicks.
The Soundex code for a name consists of a letter followed by three numbers: the letter is the first letter of the name, and the numbers encode the remaining consonants.

Similar sounding consonants share the same number so, for example, the labial B, F, P and V are all encoded as 1. Vowels can affect the coding, but are never coded directly unless they appear at the start of the name.

The exact algorithm is as follows:

Retain the first letter of the string
Remove all occurrences of the following letters, unless it is the first letter: a, e, h, i, o, u, w, y
Assign numbers to the remaining letters (after the first) as follows:
b, f, p, v = 1
c, g, j, k, q, s, x, z = 2
d, t = 3
l = 4
m, n = 5
r = 6
If two or more letters with the same number were adjacent in the original name (before step 1), or adjacent except for any intervening h and w (American census only), then omit all but the first.

Return the first four characters, right-padding with zeroes if there are fewer than four.
Using this algorithm, both "Robert" and "Rupert" return the same string "R163" while "Rubin" yields "R150".
Source: http://en.wikipedia.org/wiki/Soundex
·­awyeah·

==================================
Facebook: jawad@idsia.ch (Jay Dee)
PS: Guys, I don't accept script helps or requests personally anymore.
==================================
User avatar
Sir_Fz
Revered One
Posts: 3794
Joined: Sun Apr 27, 2003 3:10 pm
Location: Lebanon
Contact:

Post by Sir_Fz »

Soundex Tcl Package. But it won't help with detecting drones.
r
r0t3n
Owner
Posts: 507
Joined: Tue May 31, 2005 6:56 pm
Location: UK

Post by r0t3n »

Maybe if someone makes a system, it could be used to score against drones. Just need to find a way of matching the outcome to being a trojan/drone nickname...?
r0t3n @ #r0t3n @ Quakenet
User avatar
awyeah
Revered One
Posts: 1580
Joined: Mon Apr 26, 2004 2:37 am
Location: Switzerland
Contact:

Post by awyeah »

I searched the forum for soundex and found one match. A post by user in a drone matching topic regarding regexp, where he advised soundex to be used with a dictionary for matching drones. And we all know that user is never wrong. :)
Here's a great idea which i'm about to patent:
Look the nick up in a dictionary. If not found; kick the person telling them to correct the spelling error.

Seriously though...
Have you tried using soundex? I think it would work much better than just relying on the random string not containing certain chars. I suggest using a soundex "dictionary", a list or array of soundexes based on the words from a real dictionary + common names. This dictionary could then be searched to see if the soundex of a nick == some existing word's soundex, which should weed out most random strings.

Then you could have a mechanism for re-entering the room that you tell about in the kick reason, so those not meant to be kicked can rejoin. (a /msg command or what ever)

This is of course just theory and fairly cpu/memory intensive stuff, but IMO worth a try
That's the beauty of my method (I think/hope), because even if they're not real words most nicks SOUND like real words. Except strange nicks like where a number becomes a couple of chars in a word etc...but these could be translated to chars with a 'string map' before the soundex is made (in most cases) and the rest would be allowed to re-enter through the /msg command
Taken from this thread:
http://forum.egghelp.org/viewtopic.php?t=4982
·­awyeah·

==================================
Facebook: jawad@idsia.ch (Jay Dee)
PS: Guys, I don't accept script helps or requests personally anymore.
==================================
User avatar
Sir_Fz
Revered One
Posts: 3794
Joined: Sun Apr 27, 2003 3:10 pm
Location: Lebanon
Contact:

Post by Sir_Fz »

Now that's a different story, it's not just checking soundex over nicknames - there's a dictionary involved and I agree that it would work but far too much effort would be done over a really not that big of a problem.
r
r0t3n
Owner
Posts: 507
Joined: Tue May 31, 2005 6:56 pm
Location: UK

Post by r0t3n »

And where do we find a soundex dictionary, or how make one. It will to too time consuming to a human to read and enter each word from a dictionary into soundex and store it in a file/db.
r0t3n @ #r0t3n @ Quakenet
User avatar
awyeah
Revered One
Posts: 1580
Joined: Mon Apr 26, 2004 2:37 am
Location: Switzerland
Contact:

Post by awyeah »

Ask user, he's the one who suggested it, so he would know better. I don't think you need to make a dictionary, thats obviously ridiculous.. maybe use one from a tcl library or maybe from the web.
·­awyeah·

==================================
Facebook: jawad@idsia.ch (Jay Dee)
PS: Guys, I don't accept script helps or requests personally anymore.
==================================
Post Reply