Seedlists

Aus YaCyWiki
Wechseln zu: Navigation, Suche

Working

These Seedlists should be available and fresh. If not, please move it to not working

hardcoded:

http://home.arcor.de/hermens/yacy/seed.txt
http://low.audioattack.de/yacy/seed.txt
http://www.lulabad.de/seed.txt
http://www.yacy.net/seed.txt

dynamic:

http://www.anomic.de/yacy/seed.txt
http://www.suma-lab.de/yacy/seed.txt
http://www.pruefziffernberechnung.de/YaCy/seedFile.txt
http://home.arcor.de/tim.stein/yacy/127001-2.txt
https://www.ghcif.de/~yacy/seed.txt
http://www.daburna.de/YaCy/seed.txt
http://www.bachhausen.de/wohnseed.txt
http://www.bachhausen.de/seed.txt

Tor(.eff.org) Seeds can be found on De:YaCy-Tor

Not Working

Seeds which are not fresh/available at the moment.

http://yacy-websuche.mxchange.org/seeds/seed.txt  ==> redirects a user to a website
http://www.mcmilk.de/seed.txt ==> 404
http://mxchange.org/seeds/free-search.txt ==> 404
http://yacy.kicks-ass.net:8080/repository/seed.txt ==> Server not found
http://cowpuncher.drollette.com/seed.txt ==> 404
http://mirror.hfase.com/yacy/seed.txt ==> Server not found
http://85.214.248.55:8090/www/seed.txt ==> Could not connect
http://reezer.org/yacy/seed.txt ==> exists, but outdated (Jan, 4th 2010 as of Jan, 27th)
http://projekte.newsfreak.de/yacy/seed.txt ==> exist, but really outdated (e-mail from Webmaster)
http://www.krautter-consulting.de/seed/seed.txt ==> exist, but really outdated (e-mail from Webmaster)
http://trumpkin.de/yacy/seed.txt ==> exist, but really outdated, seed.txt Date:30-Juni-2008

mod_rewrite

Some search engines list the seed files regardless of an exclusion in robots.txt. To force them not to index the list you can redirect any other client than YaCy on access to the seedfile. To do this, you need mod_rewrite on Apache and the following entries in your .htaccess file:

RewriteEngine on
RewriteCond %{HTTP_USER_AGENT} !^yacy.*$ [NC]
RewriteRule seed.txt http://www.yacy.net/ [R]

Please keep in mind that now nobody except YaCy itself can download the seedfile.

How to get a fresh list of seedfiles (in Linux)

You can parse the Network.html?page=1 page on your (or another) peer to get a really fresh list of seedfiles. (i.E. in scripts ..)

lynx --source "http://<peer>:<port>/Network.html?page=1"|grep "Type: Principal"|cut -d '"' -f 4

i.E. build a simple seedfiles link checker:

lynx --source "http://<peer>:<port>/Network.html?page=1"|grep "Type: Principal"|cut -d '"' -f 4|while read link ; do
   lynx --dump "$link" &> /dev/null && echo "[OK] $link" || echo "[ERROR] $link"
 done