This is the new home of the egghelp.org community forum.
All data has been migrated (including user logins/passwords) to a new phpBB version.


For more information, see this announcement post. Click the X in the top right-corner of this box to dismiss this message.

file extension and bugmenot

Requests for complete scripts or modifications/fixes for scripts you didn't write. Response not guaranteed, and no thread bumping!
Post Reply
t
testebr
Halfop
Posts: 86
Joined: Thu Dec 01, 2005 12:22 pm

file extension and bugmenot

Post by testebr »

1 - retrieve file extension from www.filext.com

2 - search pass site in www.bugmenot.com
User avatar
rosc2112
Revered One
Posts: 1454
Joined: Sun Feb 19, 2006 8:36 pm
Location: Northeast Pennsylvania

Post by rosc2112 »

On the files extension site, how much data do you want returned? First match? All matches? Could be quite lengthy.. Hmm, doing a search for com returns 7 matches, and it's quite messy. Tell me what data you want returned.

As far as the bmn site, sure, that should be easy enough. Just give me some example site names (real ones) so I can test the html/regexp's, the only site I can think of offhand is naplesdailynews.com. I suppose there should also be a limit on number of results returned for that too, but I can make that a config option.
t
testebr
Halfop
Posts: 86
Joined: Thu Dec 01, 2005 12:22 pm

Post by testebr »

exact match.

Domains to test:

dreamcam.com.br
uol.com.br
playboy.com
sexyclube.com.br
User avatar
rosc2112
Revered One
Posts: 1454
Joined: Sun Feb 19, 2006 8:36 pm
Location: Northeast Pennsylvania

Post by rosc2112 »

They're all exact matches, what I meant was, what data do you want returned? Just the "Program and/or Extension Function" field? If you want ALL matches and ALL fields, it can end up returning hundreds of lines.

Look at: http://filext.com/file-extension/com for example.
t
testebr
Halfop
Posts: 86
Joined: Thu Dec 01, 2005 12:22 pm

Post by testebr »

"Program and/or Extension Function" field is the best solution.

Or no?
User avatar
rosc2112
Revered One
Posts: 1454
Joined: Sun Feb 19, 2006 8:36 pm
Location: Northeast Pennsylvania

Post by rosc2112 »

Sounds good to me, I'll also include the urls in the output for more details.. Give me a few days to work on the 2 scripts.
User avatar
rosc2112
Revered One
Posts: 1454
Joined: Sun Feb 19, 2006 8:36 pm
Location: Northeast Pennsylvania

Post by rosc2112 »

filext.com script is done:

http://forum.egghelp.org/viewtopic.php?p=72108

I'll work on bugmenot later.
t
testebr
Halfop
Posts: 86
Joined: Thu Dec 01, 2005 12:22 pm

Post by testebr »

very nice, work fine here.

when I grow, I want to be same you :P
User avatar
rosc2112
Revered One
Posts: 1454
Joined: Sun Feb 19, 2006 8:36 pm
Location: Northeast Pennsylvania

Post by rosc2112 »

Only takes a few months of rtfm :)

I'll let you figure out the bmn script, its not hard, use the filext script as a general guide to scraping webpages for content. If you need help, ask in the script help forum. About all you need to change is the url, the regexp and the particular format for the output.

The url's bugmenot uses take the form:
# http://www.bugmenot.com/view/hostname

Look through the html source from the resulting pages and find what's common to all the output you're looking for and grab it with regexp. Have fun reading that manpage :)
Post Reply