Relevance and Privacy Improvements to the YaCy Decentralized Web Search Engine
dc.contributor.advisor | Hougen, Dean | |
dc.contributor.author | Rand, Jeremy | |
dc.contributor.committeeMember | Cheng, Qi | |
dc.contributor.committeeMember | Radhakrishnan, Sridhar | |
dc.date.accessioned | 2018-05-11T16:04:23Z | |
dc.date.available | 2018-05-11T16:04:23Z | |
dc.date.issued | 2018 | |
dc.date.manuscript | 2018 | |
dc.description.abstract | The YaCy decentralized web search engine carries significant potential advantages in censorship resistance over centralized search engines such as Google. However, YaCy currently suffers from deficiencies in relevance of results as well as weaknesses in privacy. We have developed improvements to YaCy's relevance, including tools to generate a ranking dataset that can be fed to machine learning algorithms, fixes for some significant YaCy flaws that severely damaged ranking, and tools for ensuring that the decentralized index contains relevant results. We have also conducted an initial privacy audit of YaCy's usage of anonymizing proxies and YaCy's application-layer protocol, with recommendations for improving YaCy's privacy in both areas. We believe that this work helps pave the way for YaCy to become a credible competitor to centralized search engines. We expect future work to experiment with various machine learning implementations using our ranking dataset generation toolset, as well as implementing the improvements recommended by our initial privacy audit and conducting more extensive privacy audits once our initial recommendations are implemented. | en_US |
dc.identifier.uri | https://hdl.handle.net/11244/299892 | |
dc.language | en_US | en_US |
dc.subject | decentralization | en_US |
dc.subject | search engines | en_US |
dc.subject | machine learning | en_US |
dc.subject | anonymity | en_US |
dc.thesis.degree | Master of Science | en_US |
dc.title | Relevance and Privacy Improvements to the YaCy Decentralized Web Search Engine | en_US |
ou.group | College of Engineering::School of Computer Science | en_US |
Files
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: