This is the new home of the egghelp.org community forum.
All data has been migrated (including user logins/passwords) to a new phpBB version.


For more information, see this announcement post. Click the X in the top right-corner of this box to dismiss this message.

UNOFFICIAL incith-google 2.1x (Nov30,2o12)

Support & discussion of released scripts, and announcements of new releases.
Post Reply
User avatar
speechles
Revered One
Posts: 1398
Joined: Sat Aug 26, 2006 10:19 pm
Location: emerald triangle, california (coastal redwoods)

Post by speechles »

Download: Incith:Google v1.9.9m

New version but... this one WON'T fix every issue people are having regarding inconsistent results with "weather" and "calculator" within !google itself. What it does do is fix literally a ton of things that have broken over the course of the last month. This is the _EXACT_ version I am using myself. When I have more time I'll try to fix the problem with "weather/calculator"results. Until then consider this something to hold everybody over until then.

If you have _ANY_ issues, broken parsers, missing output, etc please feel free to shout it out here. If it has to do with google's "weather/calculator" results keep in mind this is already on the table and going to be addressed shortly. ;)
User avatar
Anahel
Halfop
Posts: 48
Joined: Fri Jul 03, 2009 6:18 pm
Location: Dom!

Post by Anahel »

i've other problem with your google script (im using the newest "m" version)

when i use !trans with japanese characters bot send [censored] to channel:

Code: Select all

<&maruda> !trans jp@en  オタク
<~Lilith> Google says: (ja->en) Translation: Japanese » English >> ŞŻGerman German German circumstances
and heres google:

Code: Select all

<tomek> !g オタク
<~Lilith> 7,360,000 Results | ĄŞ? Ą?? §?????$ @ http://jahjahsphinx.blogspot.com/ | ĄŞ? Ą?? §?????$ @ http://jahjahsphinx.blogspot.com/2009/08/blog-post.html | Ordinal indicator - Wikipedia, the free @ http://en.wikipedia.org/wiki/Ordinal_indicator | [PDF] : Ş " " Ş ˙ . . : Ş : 1. . 2 @ http://www.bibalex.org/ARABIC/Egyptology/images/coptlanglect.pdf
maybe i've something unset with charset (i havent changed anything about it - only changes i've made was set wiki to .pl language, even google is set to .com)

could you help me?
User avatar
spithash
Master
Posts: 248
Joined: Thu Jul 12, 2007 9:21 am
Location: Libera
Contact:

Post by spithash »

I don't think this is an issue of the script, it works just awesome,

I believe its a UTF fonts issue.
Libera ##rtlsdr & ##re - Nick: spithash
Click here for troll.tcl
g
gh0st
Voice
Posts: 2
Joined: Thu Aug 20, 2009 3:45 pm

Post by gh0st »

i got this error since 2 days with the weather script. im using incith-google-v1.99m

Code: Select all

Tcl error [incith::google::public_message]: can't read "f2": no such variable
User avatar
speechles
Revered One
Posts: 1398
Joined: Sat Aug 26, 2006 10:19 pm
Location: emerald triangle, california (coastal redwoods)

Post by speechles »

Anahel wrote:i've other problem with your google script (im using the newest "m" version)

when i use !trans with japanese characters bot send [censored] to channel:

Code: Select all

<&maruda> !trans jp@en  オタク
<~Lilith> Google says: (ja->en) Translation: Japanese » English >> ŞŻGerman German German circumstances
Trans has issues regarding eggdrop and how it improperly handles utf-8. Also, http-package has large flaws which this script presently only fixes slightly (See here for http-packages flaw).
Anahel wrote:and heres google:

Code: Select all

<tomek> !g オタク
<~Lilith> 7,360,000 Results | ĄŞ? Ą?? §?????$ @ http://jahjahsphinx.blogspot.com/ | ĄŞ? Ą?? §?????$ @ http://jahjahsphinx.blogspot.com/2009/08/blog-post.html | Ordinal indicator - Wikipedia, the free @ http://en.wikipedia.org/wiki/Ordinal_indicator | [PDF] : Ş " " Ş ˙ . . : Ş : 1. . 2 @ http://www.bibalex.org/ARABIC/Egyptology/images/coptlanglect.pdf
maybe i've something unset with charset (i havent changed anything about it - only changes i've made was set wiki to .pl language, even google is set to .com)

could you help me?

Code: Select all

    # encoding conversion lookups
    # here is where you can correct language encoding problems by pointing their
    # abbreviation towards an encoding. if you want more, feel free to add more.
    # this is a somewhat poor example below, there are possibly hundreds of additions
    # that need to be added to this section, this is just something to see if this
    # indeed is a work around, enjoy and merry christmas ;P
    # ------
    variable encode_strings {
      zh:gb2312
      de:iso8859-1
      es:iso8859-1
      it:iso8859-1
      nl:iso8859-1
     com:iso8859-1
   co.uk:iso8859-1
      en:iso8859-1
      fr:iso8859-1
      ro:cp1251
      bg:cp1251
      rs:cp1251
      sr:cp1251
   sr-el:cp1252
      ru:cp1251
      ar:cp1256
      jp:shiftjis
      ja:shiftjis
   co.jp:shiftjis
      tr:cp857
      kr:iso2022-kr
   co.kr:iso2022-kr
   co.il:iso8859-8
  com.ua:koi8-u
      uk:koi8-u
      hu:cp1250
      pl:iso8859-2
    }
This is where you set the encodings for each TLD (or in the case of wikimedia, the entire website domain) listed. if this list has any incorrect encoding listed next to any region/site this is up to each user to fix individually. This list is by no means, complete, accurate, nor guaranteed. Together with using the list and reading the output from debug mode of the script you should be able to figure out most of it. You might also use webby to visit the same url the bot has and see if webby can parse the page correctly. Webby features advanced encoding detection, reporting, and conflict resolution. So it's display should always be accurate and when it isn't append --swap to your url and it will resolve conflicts inversely. This isn't just a shameless plug to promote webby, this is me admitting that it's so useful to me everyday that I find it indispensable. ;)

If you need any of the config settings explained, you should ask how to use them or actually read the comments completely and I pretty much explain how everything operates. If you have any questions on any of this, it isn't dumb to ask a question. It implies you have read the comments and need clarfication.
Last edited by speechles on Thu Aug 20, 2009 6:14 pm, edited 1 time in total.
User avatar
speechles
Revered One
Posts: 1398
Joined: Sat Aug 26, 2006 10:19 pm
Location: emerald triangle, california (coastal redwoods)

Post by speechles »

gh0st wrote:i got this error since 2 days with the weather script. im using incith-google-v1.99m

Code: Select all

Tcl error [incith::google::public_message]: can't read "f2": no such variable
Doh! Why didn't you say something two days ago?! HAW :lol:

but on a more serious note, will have this fixed shortly. Watch for this post to be editing soon with a new version fixing the issue with weather, as well as fixing a handful of other bugs experienced.

Edit:

Well, how does this look? I think this fixes everything... ;)

Code: Select all

#   .9n: Corrected google weather and cruft html elements within#
#          some google no-results-found messages.               #
<speechles> !g .de wetter berlin
<sp33chy> Wetter für Berlin: 24 °C, Aktuell: Klar, Wind: SO mit 11 km/h, Feuchtigkeit: 44 %; Forecast: Do., Sunny (31°C|18°C); Fr., Chance of storm (29°C|14°C); Sa., Mostly sunny (23°C|10°C)
<speechles> !g .nl weer amsterdam
<sp33chy> Weer voor Amsterdam: 19 °C, Actueel: Plaatselijk bewolkt, Wind: W - 16 km/h, Luchtvochtigheid: 63%; Forecast: do, Chance of storm (29°C|15°C); vr, Chance of rain (21°C|13°C); za, Chance of rain (21°C|12°C)
<speechles> !g weather san francisco, california
<sp33chy> Weather for San Francisco, California: 67°F, Current: Partly Cloudy, Wind: NW at 6 mph, Humidity: 73%; Forecast: Thu, Mostly sunny (63°F|54°F); Fri, Mostly sunny (67°F|56°F); Sat, Mostly sunny (67°F|54°F)
Incith:Google v1.9.9n. Enjoy and as always, have a fun ;P

Will correct ebay and it's inconsistant shipping display for .com results shortly.. expect to see .o version at that time ;P
g
gh0st
Voice
Posts: 2
Joined: Thu Aug 20, 2009 3:45 pm

Post by gh0st »

hey thank you! :thumps up:
User avatar
speechles
Revered One
Posts: 1398
Joined: Sat Aug 26, 2006 10:19 pm
Location: emerald triangle, california (coastal redwoods)

Post by speechles »

-- singapore
<speechles> !e .com.sg bubble gum
<sp33chy> 19 results | Vincci Bubblegum Pop Colorful Opentoe He, S$ 9.90(+S$ 2.50), 0 Bids, 8h 25m @ http://cgi.ebay.com.sg/_W0QQItemZ250483340761 | BN Bonne Bell Bubble Gum Lip Smacker Glo, S$ 4.50(Free), Buy It Now, 2h 45m @ http://cgi.ebay.com.sg/_W0QQItemZ250483289799 | Japanese Animation - Bubblegum Crisis To, S$ 3.00(+S$ 2.00), 0 Bids, 1d 3h 27m @ http://cgi.ebay.com.sg/_W0QQItemZ270444229706

-- malaysia
<speechles> !e .com.my bride
<sp33chy> 148 results | BRAND NEW The Petrakos Bride by Lynne Gr, RM 18.00(+RM 4.00), 0 Bids, 3h 9m @ http://cgi.ebay.com.my/_W0QQItemZ380149390697 | NEW The Italian's Future Bride by Michel, RM 18.00(+RM 4.00), 0 Bids, 3h 17m @ http://cgi.ebay.com.my/_W0QQItemZ380149391616 | NEW The Ranieri Bride by Michelle Reid, RM 18.00(+RM 4.00), 0 Bids, 3h 18m @ http://cgi.ebay.com.my/_W0QQItemZ380149391840

-- germany
<speechles> !e .de eier
<sp33chy> 45.236 Ergebnisse | WAHNSINNNNNN zwischen 2500 und 2700 Hpf, EUR 20,50(+EUR 15,00), 11 Gebote, 6T 16Std 38Min @ http://cgi.ebay.de/_W0QQItemZ230368738997 | O EI A Jubelausgabe 2003-04, EUR 1,00(+EUR 4,90), 0 Gebote, 25Min @ http://cgi.ebay.de/_W0QQItemZ350241866986 | Ü-Ei Figur Sonne, Mond und Sterne Stern, EUR 1,00(+EUR 2,00), 0 Gebote, 41Min @ http://cgi.ebay.de/_W0QQItemZ130324363220

-- united states
<speechles> !e .com eggs
<sp33chy> 18,628 results | 155.0ct RUBY AMETHYST GARNET BT STERLING, $202.50(), 10 Bids, 6d 17h 26m @ http://cgi.ebay.com/_W0QQItemZ310162811776 | Little Giant Egg Incubator w/ Forced Air, $142.95(Free shipping), Buy It Now, 26d 11h 41m @ http://cgi.ebay.com/_W0QQItemZ330353195089 | Ped Egg Professional Pedicure & 3 File B, $7.49(Free shipping), Buy It Now, 8d 17h 23m @ http://cgi.ebay.com/_W0QQItemZ260465595310

-- france
<speechles> !e .fr pink
<sp33chy> 24 022 résultats | 6 ÉPINGLES À CHEVEUX MARIÉE MARIAGE PART, 4,99 EUR(+3,99 EUR), 0 Enchère, 6m @ http://cgi.ebay.fr/_W0QQItemZ140339874791 | BONNET DE NAISSANCE "J'AIME PAPA" - ROSE, 3,50 EUR(+1,88 EUR), Achat immédiat, 8j 7h 17m @ http://cgi.ebay.fr/_W0QQItemZ200375358198 | GANTS SANS DOIGT MARIÉE MARIAGE PARTIE ~, 4,99 EUR(+3,99 EUR), 0 Enchère, 7m @ http://cgi.ebay.fr/_W0QQItemZ140339874835
Shipping and all parsing is now detected perfectly (an empty shipping field indicates just that this isn't a bug). This uses several different templates and schemes to extract information from almost every regional ebay server you feed it. The ones above are merely examples demonstrating this behavior. There are several more you can use as well somewhere around 20ish or so. This update has also corrected the bug within google video display causing things to run together.

Incith:Google v1.9.9o .. Enjoy everything now works as intended... :D
User avatar
speechles
Revered One
Posts: 1398
Joined: Sat Aug 26, 2006 10:19 pm
Location: emerald triangle, california (coastal redwoods)

Post by speechles »

Had some time to investigate google translations and it's total lack of understanding encodings and seem to have stumbled upon an answer of sorts.

Code: Select all

    # Google translation input/output encodings
    # set these below to either:
    # 0 for automatic encoding
    # 1 for utf-8
    # You may use switches to inverse this behavior on-the-fly. Set them
    # below as well.
    # ------
    variable trans_input 1
    variable trans_output 0
    variable trans_input_t "--swapi"
    variable trans_output_t "--swapo"
With these options you can now control how encodings for google translations are seen both as input and output. Using 0 will automatically detect, while using 1 will strictly assume utf-8. This is all fine and dandy until you discover the problem. It would be nice to inverse this, maybe on the fly for the few languages which don't work quite as expected and so people may experiment on their own. Well now this is possible, you can modify any of these attributes using the 2 switches you create within trans_input_t and trans_output_t.
Using 1 for input encoding seems to solves the accented character problem as shown below:

<speechles> !tr Google Traducciones parece odiar eggdrops por alguna razón
<sp33chy> Google says: (auto->en) Translation: Galician (automatically detected) » English >> Google seems to hate eggdrops translations for some reason
<speechles> !tr Google Traducciones parece odiar eggdrops por alguna razón --swapi
<sp33chy> Google says: (auto->en) Translation: Galician (automatically detected) » English >> Google translations seem to hate any race for eggdrops § Ã £ o

Now the problem with using 1 for output encoding is that it doesn't seem to work for all languages. This is probably a direct result of eggdrop not being able to handle utf-8 correctly which 1 means.. But, you do have the ability to inversely affect the output encodings as shown below:

<speechles> !tr Google Traducciones parece odiar eggdrops por alguna razón --swapo
<sp33chy> Google says: (auto->en) Translation: Galician (automatically detected) » English >> Google seems to hate eggdrops translations for some reason

Keep in mind you can place these "switches" anywhere in the string and they will be properly detected and removed. So you must be careful what you name them to keep them from being seen as of any text you may wish to translate. The debug output during these queries (if enabled and debugnick is set correctly) will clearly indicate which encodings are being used each time.
The problem here is that choosing utf-8, which --swapo does in this case (using default options shown above), there is no notable difference between the automatic (iso8859-1, in this case) representation and the utf-8 one. Not until you start to deal with accented characters and eggdrop mishandles things. If your bot is patched with either the suzi or utf-8 patch this may work wonders now for you, as every query within google now embeds the encoding within the query line. This wasn't happening before and was the reason for alot of the problems. Utf-8 folks should now be alot happier. Even if you aren't using utf-8, this works much better than any previous version...

Incith:Google v1.9.9p .. Have a fun :lol:
User avatar
Anahel
Halfop
Posts: 48
Joined: Fri Jul 03, 2009 6:18 pm
Location: Dom!

Post by Anahel »

hmm i've installeg the "p" version and still have issues with japanese language, i've tested polish too and there's a problem too:

xchat:

Code: Select all

<tomek> !tr pl@en liść
<~Lilith> Google says: (pl->en) Translation: Polish » English >> If [?
<tomek> !tr ja@en 私
<~Lilith> Google says: (ja->en) Translation: Japanese » English >> Á
mirc:

Code: Select all

<&anahel> !tr pl@en gość
<~Lilith> Google says: (pl->en) Translation: Polish » English >> it [?
<&anahel> !tr jp@en ポオランド
<~Lilith> Google says: (ja->en) Translation: Japanese » English >> ÝŞ éóÉ
so there's still an issue with iso-8859-2 (polish) and (shifts something and dont remember jap charset, later i've changed it to utf-8 which works much better than original)

and heres test of you example:

Code: Select all

<tomek> !tr Google Traducciones parece odiar eggdrops por alguna razón
<~Lilith> Google says: (auto->en) Translation: Galician (automatically detected) » English >> Google seems to hate eggdrops translations for some reason


Image
t
transacid
Voice
Posts: 12
Joined: Wed Aug 08, 2007 2:15 pm
Location: Hamburg / Germany
Contact:

Post by transacid »

apparently there is something wrong with the parser. (besides the wrong usage but this shouldn't happen at all IMHO)
the query:
[21:36:14] ( transacid) !g hl:de define:potz
the output:
Image

another thing:

Code: Select all

[22:27:15] ( transacid) !w plenk
[22:27:16] ( NeXuS) Wikimedia Error: Unable to parse for: plenk @ 
                    http://en.wikipedia.org/wiki/index.php?title=Special%3ASearch&search=plenk&fulltext=Search
User avatar
speechles
Revered One
Posts: 1398
Joined: Sat Aug 26, 2006 10:19 pm
Location: emerald triangle, california (coastal redwoods)

Post by speechles »

transacid wrote:apparently there is something wrong with the parser. (besides the wrong usage but this shouldn't happen at all IMHO)
It's bound to happen eventually. It's simply the nature of web scraping. Html changes with the wind. In a perfect (read this as free API) world we wouldn't need to screen/web scrape a thing. But this world we live in is far from that... ;)
another thing:
transacid wrote:

Code: Select all

[22:27:15] ( transacid) !w plenk
[22:27:16] ( NeXuS) Wikimedia Error: Unable to parse for: plenk @ 
                    http://en.wikipedia.org/wiki/index.php?title=Special%3ASearch&search=plenk&fulltext=Search
Note: This is actually a bug within wikipedia/wikimedia sites themselves, not anything lacking in the script. The problem has to do with problematic squid caches returning outdated incorrect replies. See here for more info. ;)

This occurs because that page just so happens to be gzipped. This script expects text data to flow. Not binary compressed data. It's no longer an issue though because I've incorporated support for the zlib package within the script now. It will seamlessly decompress those pesky gzipped wikipedia/media pages easily.

http://wiki.tcl.tk/4610 - this url explains most of what is now required. And by required, I mean if you don't need gzip for your wiki pleasures then by all means it's not required. But.. If you experience any issue like above, your going to need it.

Windrop users without tcl8.6 handy (which includes myself :D) can simply use the url below to download a precompiled binary package, copying the entire folder to their windrop/lib/tcl folder gets you going.
http://pascal.scheffers.net/software/zl ... -win32.zip

Eggdrop users, on actual eggdrops running flavors of *nix can simply rely on trf package to work the gzip magic. This is to allow future versions of tcl (zlib) to work as well as the older (trf).

Now for the new additions to the script:

Code: Select all

    # set this to the proxy you wish the main html fetching procedure to
    # use. set both proxy_host and proxy_port. if you don't wish to use
    # a proxy. set proxy_host ""
    # --
    variable proxy_host ""
    variable proxy_port ""

    # set this to the switch you want to use for youtube to filter content
    # and display only high-definition videos.
    # ------
    variable youtube_highdef "@hd"
Those previously getting 503 google sorry messages will now be much happier. There is now a proper proxy system present within the script as well. You can also now search youtube and filter high-definition videos only. This switch is customizeable as well and can be placed anywhere (except before the .region switch).
!y @hd .de whatever - Bad, the high-def switch is placed before the region.
!y .de whatever @hd / !y .fr @hd whatever - both 100% acceptable.
<speechles> !wm .en.wikipedia.org/wiki Plenk
<sp33chy> Plenken | Plenken is a German typographical term for the insertion of inappropriate spaces before a punctuation mark. Its counterpart is Klempen, the incorrect omission of a space after punctuation marks. @ http://en.wikipedia.org/wiki/Plenken

<speechles> !w Plenk
<sp33chy> Plenken | Plenken is a German typographical term for the insertion of inappropriate spaces before a punctuation mark. Its counterpart is Klempen, the incorrect omission of a space after punctuation marks. @ http://en.wikipedia.org/wiki/Plenken

<speechles> !g hl:de define:potz
<sp33chy> 549 Results | Did you mean: hl:dt define:putz | Urban Dictionary: potz @ http://www.urbandictionary.com/define.php?term=potz | Specificity of antibodies produced again @ http://www.pubmedcentral.nih.gov/articl ... tid=237545 | Mutations within the proteolytic cleavag @ http://www.pubmedcentral.nih.gov/articl ... tid=240787
Edit: While on the subject of gzipped webpages. Why not incorporate this bandwidth saving feature into the script. This has now been done. If you have zlib or trf packages handy on your bot this script will now use them and save bandwidth using it for every query it makes. In my tests this seems to speed up page loading considerably. It is not a requirement to use zlib or trf packages, the script will still function without them. But for those with it, you will notice better response times. ;)
Download: Incith:Google v1.9.9r
<speechles> !webby http://en.wikipedia.org/wiki/index.php? ... ext=Search --gz
<sp33chy> plenk - Search results - Wikipedia, the free encyclopedia ( http://u.nu/67t73 )( 200; text/html; utf-8; 9576 bytes (gzip); 23390 bytes )
As you can see above using webby to illustrate. Simply gzipping the query arrives at 9576 bytes. While the entire html would've been 23390 bytes. This equates to quite a savings over time. Enjoy, and most important. Have a fun ;P
g
glitchiker
Voice
Posts: 1
Joined: Wed Aug 26, 2009 7:12 pm

Post by glitchiker »

zlib is now a part of tcl 8.6 for those of you / us who are eggdrop users.

If you are getting this error, it may be because zlib is not being called correctly in the script due to the version of tcl you may have. Versions 8.6b1.1 going forward all now have zlib packaged within tcl 8.6.

Error:
Incith:Google : Unable to find zlib package. Wikipedia and Wikimedia require gzip decompression. As a result, every article will not work for you.
The shell provider and I worked on the issue and came up with this solution. Alter the code as needed.

*To find out what tcl packages are being used on your host use the following command:
pkg_info | grep tcl
For tcl 8.6b1.1 users and below. "DO NOT ALTER THIS CODE":
if {[catch {package require zlib} error] !=0} {
Anyone who is using tcl 8.6b1.1 and above, go ahead and modify the previous line to suit the following:
if {[lsearch [info commands] zlib] == -1 && [catch {package require zlib} error] !=0} {
Shell provider: "Just tell him that the zlib package is now a part of tcl and does not need to be loaded".

This fix may only be temporary until speechles does something new, and if none of this works, you may have to stay with the frontend script http 2.5. Make sure the script is loaded before incith-google*.

You can get the script from here: http://incith.com/eggdrop/http.tcl

Also you may have to comment the zlib section of the script out.
#if {[catch {package require zlib} error] !=0} {
#putlog "Incith:Google : Unable to find zlib package. Wikipedia and Wikimedia require gzip decompression. As a result, every article will not work for you."
#set incith_nozlib 1
#set incith_hdr ""
#} else {
set incith_hdr "Accept-Encoding gzip,deflate"
#}
Another side note. If you are compiling eggdrop-1.6.19 on tcl 8.6* you may have some difficulty unless you hack into the compilation process first. Add the following code to config.h in between ./configure and make config.
#define USE_INTERP_RESULT 1
Enjoy,

glitch

8)
T
TehJibba
Voice
Posts: 1
Joined: Sat Sep 19, 2009 5:08 pm

Post by TehJibba »

Youtube changed its site code again and is the only part of the new version that doesnt work for me. Was hoping for maybe an updated snippet or workaround possibly
s
shadrach
Halfop
Posts: 74
Joined: Fri Dec 14, 2007 6:29 pm

Post by shadrach »

'locate' seems not to be working.
Post Reply