{"id":2222,"date":"2023-05-03T15:30:05","date_gmt":"2023-05-03T07:30:05","guid":{"rendered":"https:\/\/www.appblog.cn\/?p=2222"},"modified":"2023-05-04T05:45:01","modified_gmt":"2023-05-03T21:45:01","slug":"9-java-based-search-engine-frameworks","status":"publish","type":"post","link":"https:\/\/www.appblog.cn\/index.php\/2023\/05\/03\/9-java-based-search-engine-frameworks\/","title":{"rendered":"9\u4e2a\u57fa\u4e8eJava\u7684\u641c\u7d22\u5f15\u64ce\u6846\u67b6"},"content":{"rendered":"<h2>Java \u5168\u6587\u641c\u7d22\u5f15\u64ce\u6846\u67b6 Lucene<\/h2>\n<p>\u6beb\u65e0\u7591\u95ee\uff0cLucene\u662f\u76ee\u524d\u6700\u53d7\u6b22\u8fce\u7684Java\u5168\u6587\u641c\u7d22\u6846\u67b6\uff0c\u51c6\u786e\u5730\u8bf4\uff0c\u5b83\u662f\u4e00\u4e2a\u5168\u6587\u68c0\u7d22\u5f15\u64ce\u7684\u67b6\u6784\uff0c\u63d0\u4f9b\u4e86\u5b8c\u6574\u7684\u67e5\u8be2\u5f15\u64ce\u548c\u7d22\u5f15\u5f15\u64ce\uff0c\u90e8\u5206\u6587\u672c\u5206\u6790\u5f15\u64ce\u3002Lucene\u4e3a\u5f00\u53d1\u4eba\u5458\u63d0\u4f9b\u4e86\u76f8\u5f53\u5b8c\u6574\u7684\u5de5\u5177\u5305\uff0c\u53ef\u4ee5\u975e\u5e38\u65b9\u4fbf\u5730\u5b9e\u73b0\u5f3a\u5927\u7684\u5168\u6587\u68c0\u7d22\u529f\u80fd\u3002\u4e0b\u9762\u6709\u51e0\u6b3e\u641c\u7d22\u5f15\u64ce\u6846\u67b6\u4e5f\u662f\u57fa\u4e8eLucene\u5b9e\u73b0\u7684\u3002<\/p>\n<p>\u5b98\u65b9\u7f51\u7ad9\uff1a<a target=\"_blank\" rel=\"noopener\" href=\"http:\/\/lucene.apache.org\/\">http:\/\/lucene.apache.org\/<\/a><\/p>\n<h2>\u5f00\u6e90Java\u641c\u7d22\u5f15\u64ceNutch<\/h2>\n<p>Nutch \u662f\u4e00\u4e2a\u5f00\u6e90Java\u5b9e\u73b0\u7684\u641c\u7d22\u5f15\u64ce\u3002\u5b83\u63d0\u4f9b\u4e86\u6211\u4eec\u8fd0\u884c\u81ea\u5df1\u7684\u641c\u7d22\u5f15\u64ce\u6240\u9700\u7684\u5168\u90e8\u5de5\u5177\u3002\u5305\u62ec\u5168\u6587\u641c\u7d22\u548cWeb\u722c\u866b\u3002<\/p>\n<p>\u5229\u7528Nutch\uff0c\u4f60\u53ef\u4ee5\u505a\u5230\u4ee5\u4e0b\u8fd9\u4e9b\u529f\u80fd\uff1a<\/p>\n<ul>\n<li>\u6bcf\u4e2a\u6708\u53d6\u51e0\u5341\u4ebf\u7f51\u9875<\/li>\n<li>\u4e3a\u8fd9\u4e9b\u7f51\u9875\u7ef4\u62a4\u4e00\u4e2a\u7d22\u5f15<\/li>\n<li>\u5bf9\u7d22\u5f15\u6587\u4ef6\u8fdb\u884c\u6bcf\u79d2\u4e0a\u5343\u6b21\u7684\u641c\u7d22<\/li>\n<li>\u63d0\u4f9b\u9ad8\u8d28\u91cf\u7684\u641c\u7d22\u7ed3\u679c<\/li>\n<li>\u4ee5\u6700\u5c0f\u7684\u6210\u672c\u8fd0\u4f5c<\/li>\n<\/ul>\n<p>\u5b98\u65b9\u7f51\u7ad9\uff1a<a target=\"_blank\" rel=\"noopener\" href=\"http:\/\/nutch.apache.org\/\">http:\/\/nutch.apache.org\/<\/a><\/p>\n<h2>\u5206\u5e03\u5f0f\u641c\u7d22\u5f15\u64ce ElasticSearch<\/h2>\n<p>ElasticSearch\u5c31\u662f\u4e00\u6b3e\u57fa\u4e8eLucene\u6846\u67b6\u7684\u5206\u5e03\u5f0f\u641c\u7d22\u5f15\u64ce\uff0c\u5e76\u4e14\u4e5f\u662f\u4e00\u6b3e\u4e3a\u6570\u4e0d\u591a\u7684\u57fa\u4e8eJSON\u8fdb\u884c\u7d22\u5f15\u7684\u641c\u7d22\u5f15\u64ce\u3002ElasticSearch\u7279\u522b\u9002\u5408\u5728\u4e91\u8ba1\u7b97\u5e73\u53f0\u4e0a\u4f7f\u7528\u3002<\/p>\n<p>\u5b98\u65b9\u7f51\u7ad9\uff1a<a target=\"_blank\" rel=\"noopener\" href=\"http:\/\/www.elasticsearch.org\/\">http:\/\/www.elasticsearch.org\/<\/a><\/p>\n<h2>\u5b9e\u65f6\u5206\u5e03\u5f0f\u641c\u7d22\u5f15\u64ce Solandra<\/h2>\n<p>Solandra \u662f\u4e00\u4e2a\u5b9e\u65f6\u7684\u5206\u5e03\u5f0f\u641c\u7d22\u5f15\u64ce\uff0c\u57fa\u4e8e Apache Solr \u548c Apache Cassandra \u6784\u5efa\u3002<\/p>\n<p>\u5176\u7279\u6027\u5982\u4e0b\uff1a<\/p>\n<ul>\n<li>\u652f\u6301Solr\u7684\u5927\u591a\u6570\u9ed8\u8ba4\u7279\u6027 (search, faceting, highlights)<\/li>\n<li>\u6570\u636e\u590d\u5236\uff0c\u5206\u7247\uff0c\u7f13\u5b58\u53ca\u538b\u7f29\u8fd9\u4e9b\u90fd\u7531Cassandra\u6765\u8fdb\u884c<\/li>\n<li>Multi-master (\u4efb\u610f\u7ed3\u70b9\u90fd\u53ef\u4f9b\u8bfb\u5199)<\/li>\n<li>\u5b9e\u65f6\u6027\u9ad8\uff0c\u5199\u64cd\u4f5c\u5b8c\u6210\u5373\u53ef\u8bfb\u5230<\/li>\n<li>Easily add new SolrCores w\/o restart across the cluster \u8f7b\u677e\u6dfb\u52a0\u53ca\u91cd\u542f\u7ed3\u70b9<\/li>\n<\/ul>\n<p>\u5b98\u65b9\u7f51\u7ad9\uff1a<a target=\"_blank\" rel=\"noopener\" href=\"https:\/\/github.com\/tjake\/Solandra\">https:\/\/github.com\/tjake\/Solandra<\/a><\/p>\n<h2>IndexTank<\/h2>\n<p>IndexTank\u662f\u4e00\u5957\u57fa\u4e8eJava\u7684\u7d22\u5f15-\u5b9e\u65f6\u5168\u6587\u641c\u7d22\u5f15\u64ce\u5b9e\u73b0\uff0cIndexTank\u6709\u4ee5\u4e0b\u51e0\u4e2a\u7279\u70b9\uff1a<\/p>\n<ul>\n<li>\u7d22\u5f15\u66f4\u65b0\u5b9e\u65f6\u751f\u6548<\/li>\n<li>\u5730\u7406\u4f4d\u7f6e\u641c\u7d22<\/li>\n<li>\u652f\u6301\u591a\u79cd\u5ba2\u6237\u7aef\u8bed\u8a00<\/li>\n<li>Ruby, Rails, Python, Java, PHP, .NET &amp; more!<\/li>\n<li>\u652f\u6301\u7075\u6d3b\u7684\u6392\u5e8f\u4e0e\u8bc4\u5206\u63a7\u5236<\/li>\n<li>\u652f\u6301\u81ea\u52a8\u5b8c\u6210<\/li>\n<li>\u652f\u6301\u9762\u641c\u7d22\uff08facet search\uff09<\/li>\n<li>\u652f\u6301\u5339\u914d\u9ad8\u4eae<\/li>\n<li>\u652f\u6301\u6d77\u91cf\u6570\u636e\u6269\u5c55\uff08Scalable from a personal blog to hundreds of millions of documents! \uff09<\/li>\n<li>\u652f\u6301\u52a8\u6001\u6570\u636e<\/li>\n<\/ul>\n<p>\u5b98\u65b9\u7f51\u7ad9\uff1a<a target=\"_blank\" rel=\"noopener\" href=\"https:\/\/github.com\/linkedin\/indextank-engine\">https:\/\/github.com\/linkedin\/indextank-engine<\/a><\/p>\n<h2>\u641c\u7d22\u5f15\u64ce Compass<\/h2>\n<p>Compass\u662f\u4e00\u4e2a\u5f3a\u5927\u7684\uff0c\u4e8b\u52a1\u7684\uff0c\u9ad8\u6027\u80fd\u7684\u5bf9\u8c61\/\u641c\u7d22\u5f15\u64ce\u6620\u5c04(OSEM:object\/search engine mapping)\u4e0e\u4e00\u4e2aJava\u6301\u4e45\u5c42\u6846\u67b6\u3002Compass\u5305\u62ec:<\/p>\n<ul>\n<li>\u641c\u7d22\u5f15\u64ce\u62bd\u8c61\u5c42(\u4f7f\u7528Lucene\u641c\u7d22\u5f15\u8350)<\/li>\n<li>OSEM (Object\/Search Engine Mapping) \u652f\u6301<\/li>\n<li>\u4e8b\u52a1\u7ba1\u7406<\/li>\n<li>\u7c7b\u4f3c\u4e8eGoogle\u7684\u7b80\u5355\u5173\u952e\u5b57\u67e5\u8be2\u8bed\u8a00<\/li>\n<li>\u53ef\u6269\u5c55\u4e0e\u6a21\u5757\u5316\u7684\u6846\u67b6<\/li>\n<li>\u7b80\u5355\u7684API<\/li>\n<\/ul>\n<p>\u5b98\u65b9\u7f51\u7ad9\uff1a<a target=\"_blank\" rel=\"noopener\" href=\"http:\/\/www.compass-project.org\/\">http:\/\/www.compass-project.org\/<\/a><\/p>\n<h2>Java\u5168\u6587\u641c\u7d22\u670d\u52a1\u5668 Solr<\/h2>\n<p>Solr\u4e5f\u662f\u57fa\u4e8eJava\u5b9e\u73b0\u7684\uff0c\u5e76\u4e14\u662f\u57fa\u4e8eLucene\u5b9e\u73b0\u7684\uff0cSolr\u7684\u4e3b\u8981\u7279\u6027\u5305\u62ec\uff1a\u9ad8\u6548\u3001\u7075\u6d3b\u7684\u7f13\u5b58\u529f\u80fd\uff0c\u5782\u76f4\u641c\u7d22\u529f\u80fd\uff0c\u9ad8\u4eae\u663e\u793a\u641c\u7d22\u7ed3\u679c\u3002\u503c\u5f97\u6ce8\u610f\u7684\u662f\uff0cSolr\u8fd8\u63d0\u4f9b\u4e00\u6b3e\u5f88\u68d2\u7684Web\u754c\u9762\u6765\u7ba1\u7406\u7d22\u5f15\u7684\u6570\u636e\u3002<\/p>\n<p>\u5b98\u65b9\u7f51\u7ad9\uff1a<a target=\"_blank\" rel=\"noopener\" href=\"http:\/\/lucene.apache.org\/solr\/\">http:\/\/lucene.apache.org\/solr\/<\/a><\/p>\n<h2>Lucene\u56fe\u7247\u641c\u7d22 LIRE<\/h2>\n<p>LIRE\u662f\u4e00\u6b3e\u57fa\u4e8eJava\u7684\u56fe\u7247\u641c\u7d22\u6846\u67b6\uff0c\u5176\u6838\u5fc3\u4e5f\u662f\u57fa\u4e8eLucene\u7684\uff0c\u5229\u7528\u8be5\u7d22\u5f15\u5c31\u80fd\u591f\u6784\u5efa\u4e00\u4e2a\u57fa\u4e8e\u5185\u5bb9\u7684\u56fe\u50cf\u68c0\u7d22(content-based image retrieval\uff0cCBIR)\u7cfb\u7edf\uff0c\u6765\u641c\u7d22\u76f8\u4f3c\u7684\u56fe\u50cf\u3002<\/p>\n<p>\u5b98\u65b9\u7f51\u7ad9\uff1a<a target=\"_blank\" rel=\"noopener\" href=\"http:\/\/www.semanticmetadata.net\/lire\/\">http:\/\/www.semanticmetadata.net\/lire\/<\/a><\/p>\n<h2>\u5168\u6587\u672c\u641c\u7d22\u5f15\u64ce Egothor<\/h2>\n<p>Egothor\u662f\u4e00\u4e2a\u7528Java\u7f16\u5199\u7684\u5f00\u6e90\u800c\u9ad8\u6548\u7684\u5168\u6587\u672c\u641c\u7d22\u5f15\u64ce\u3002\u501f\u52a9Java\u7684\u8de8\u5e73\u53f0\u7279\u6027\uff0cEgothor\u80fd\u5e94\u7528\u4e8e\u4efb\u4f55\u73af\u5883\u7684\u5e94\u7528\uff0c\u65e2\u53ef\u914d\u7f6e\u4e3a\u5355\u72ec\u7684\u641c\u7d22\u5f15\u64ce\uff0c\u53c8\u80fd\u7528\u4e8e\u4f60\u7684\u5e94\u7528\u4f5c\u4e3a\u5168\u6587\u68c0\u7d22\u4e4b\u7528\u3002<\/p>\n<p>\u5b98\u65b9\u7f51\u7ad9\uff1a<a target=\"_blank\" rel=\"noopener\" href=\"http:\/\/www.egothor.org\/cms\/\">http:\/\/www.egothor.org\/cms\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Java \u5168\u6587\u641c\u7d22\u5f15\u64ce\u6846\u67b6 Lucene \u6beb\u65e0\u7591\u95ee\uff0cLucene\u662f\u76ee\u524d\u6700\u53d7\u6b22\u8fce\u7684Java\u5168\u6587\u641c\u7d22\u6846\u67b6\uff0c\u51c6\u786e\u5730\u8bf4 [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[97],"tags":[564,566,180,563,559,561,562,565],"class_list":["post-2222","post","type-post","status-publish","format-standard","hentry","category-tools-skills","tag-compass","tag-egothor","tag-elasticsearch","tag-indextank","tag-lucene","tag-nutch","tag-solandra","tag-solr"],"_links":{"self":[{"href":"https:\/\/www.appblog.cn\/index.php\/wp-json\/wp\/v2\/posts\/2222","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.appblog.cn\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.appblog.cn\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.appblog.cn\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.appblog.cn\/index.php\/wp-json\/wp\/v2\/comments?post=2222"}],"version-history":[{"count":0,"href":"https:\/\/www.appblog.cn\/index.php\/wp-json\/wp\/v2\/posts\/2222\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.appblog.cn\/index.php\/wp-json\/wp\/v2\/media?parent=2222"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.appblog.cn\/index.php\/wp-json\/wp\/v2\/categories?post=2222"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.appblog.cn\/index.php\/wp-json\/wp\/v2\/tags?post=2222"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}