Website Search Results
Page 1 of 7 website results
-
Sometimes You Feel Like a Nutch: The Un-Googlification of a Library Search Service - Bitstreams: The
https://blogs.library.duke.edu/bitstreams/2021/11/05/sometimes-you-feel-like-a-nutch-the-un-googlification-of-a-library-search-service/
Apache Nutch is open source web crawler software written in Java. It’s been around for nearly 20 years–almost as long as Google.
-
New Framework for Search Results Page - Duke University Libraries Blogs
https://blogs.library.duke.edu/blog/2021/06/22/new-framework-for-search-results-page/
We have switched to a website search based on two open source tools, Nutch (a web crawler) and Solr (a search platform). Using (...)
-
Archiving Social Media about Duke Activism - The Devil's Tale
https://blogs.library.duke.edu/rubenstein/2016/04/25/archiving-social-media-duke-activism/
We used three tools to primarily collect web materials, each with its own strengths. The Rubenstein Library subscribes to the Internet (...)
-
Bitstreams: The Digital Collections Blog - Page 2 of 36 - Notes from the Duke University Libraries D
https://blogs.library.duke.edu/bitstreams/page/2/
Apache Nutch is open source web crawler software written in Java. It’s been around for nearly 20 years–almost as long as Google.
-
The Devil's Tale - Page 40 of 128 - Dispatches from the David M. Rubenstein Rare Book and Manuscript
https://blogs.library.duke.edu/rubenstein/page/40/
We used three tools to primarily collect web materials, each with its own strengths. The Rubenstein Library subscribes to the Internet (...)
-
The Devil's Tale - Page 42 of 130 - Dispatches from the David M. Rubenstein Rare Book and Manuscript
https://blogs.library.duke.edu/rubenstein/page/42/
We used three tools to primarily collect web materials, each with its own strengths. The Rubenstein Library subscribes to the Internet (...)
-
Rotten Links (Are Big Time-Sinks)
https://dukelawref.blogspot.com/2014/07/rotten-links-are-big-time-sinks.html
Cached versions of pages change frequently. To view versions of a web page which are older than available search engine caches, try the (...)