| Advanced internet searching strategies & "wizard seeking" tips by fravia+, 28 December 2004, Version 0.023 This file dwells @ http://www.searchlores.org/berlin2004.htm Introduction Scaletta of this session A glimpse of the web? Searching for disappeared sites Many rabbits out of the hat   Slides Some reading material An assignment Nemo solus satis sapit {Bk:flange of myth } {rose.htm} |
This document is listing a palette of possible points to be discussed is my own in fieri contribution to the 21C3 ccc's event (December 2004, Berlin). The aim of this workshop is to give European hackers some "cosmic" searching power, because they will need it badly when (and if) they will wage battle against the powers that be. The ccc-friends in Berlin have insisted on a "paper" to be presented before the workshop, which isn't easy, since a lot of content may depend on the kind of audience I'll find: you never know, before, how much web-savvy (or clueless) the participants will be... usually you just realize it during (or after) a session. Hopefully, a European hacker congress will allow some (more) complex searching techniques to be discussed. Anyway the real workshop will probably differ a lot from this list of points, techniques and aspects of web-searching that need to be explained -again and again- if we want people to understand that seeking encompasses MUCH MORE than just using the main search engines ā la google, fast or inktomi with "one-word" simple queries. I have kept this document on a rather schematic plane: readers will at least be able to read this file before the workshop itself,which may prove useful: in fact there are various things to digest even during such a short session, and many lore will remain uncovered. The aim is anyway to point readers towards non-commercial working approaches and possible solutions; above all, to enable them to find more (sound) material by themselves on the deep web of knowledge. If you learn to search the web well, you won't need nobody's workshops anymore :-) Keep an eye on this URL, especially if you do not manage to come to Berlin... It may even get updated :-) |
Gutenberg's Database
search
Search by Author or Title. For more guidance, see the
Advanced Search page,
where you can specify language, topic and more.
| (Leading zeroes MUST be MANUALLY added) |
#mysql dump filetype:sql
AIM buddy lists
allinurl:/examples/jsp/snp/snoop.jsp
allinurl:servlet/SnoopServlet
cgiirc.conf
cgiirc.conf
filetype:conf inurl:firewall -intitle:cvs
filetype:eml eml +intext:"Subject" +intext:"From" +intext:"To"
filetype:lic lic intext:key
filetype:mbx mbx intext:Subject
filetype:wab wab
Financial spreadsheets: finance.xls
Financial spreadsheets: finances.xls
Ganglia Cluster Reports
generated by wwwstat
haccess.ctl
haccess.ctl
Host Vulnerability Summary Report
HTTP_FROM=googlebot googlebot.com "Server_Software="
ICQ chat logs, please...
Index of / "chat/logs"
intext:"Tobias Oetiker" "traffic analysis"
intitle:"index of" mysql.conf OR mysql_config
intitle:"statistics of" "advanced web statistics"
intitle:"Usage Statistics for" "Generated by Webalizer"
intitle:"wbem" compaq login
intitle:admin intitle:login
intitle:index.of "Apache" "server at"
intitle:index.of cleanup.log
intitle:index.of dead.letter
intitle:index.of inbox
intitle:index.of inbox dbx
intitle:index.of ws_ftp.ini
inurl:"newsletter/admin/"
inurl:"newsletter/admin/" intitle:"newsletter admin"
inurl:"smb.conf" intext:"workgroup" filetype:conf conf
inurl:admin filetype:xls
inurl:admin intitle:login
inurl:cgi-bin/printenv
inurl:changepassword.asp
inurl:fcgi-bin/echo
inurl:main.php phpMyAdmin
inurl:main.php Welcome to phpMyAdmin
inurl:perl/printenv
inurl:server-info "Apache Server Information"
inurl:server-status "apache"
inurl:tdbin
inurl:vbstats.php "page generated"
ipsec.conf
ipsec.secrets
ipsec.secrets
Most Submitted Forms and Scripts "this section"
mt-db-pass.cgi files
mystuff.xml - Trillian data files
Network Vulnerability Assessment Report
not for distribution confidential
phpinfo.php
phpMyAdmin "running on" inurl:"main.php"
phpMyAdmin dumps
phpMyAdmin dumps
produced by getstats
Request Details "Control Tree" "Server Variables"
robots.txt
robots.txt "Disallow:" filetype:txt
robots.txt "Disallow:" filetype:txt
Running in Child mode
site:edu admin grades
SQL data dumps
Squid cache server reports
Thank you for your order +receipt
This is a Shareaza Node
This report was generated by WebLog
