Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

Abstract

This concept page defines some rough ideas that could be used to either refine search in the ressources module using SolR or make a more generic faceted search using only Solr's faceted search.

Choice 1: Enhancing the resources module

Goal

The goal is to do a refined search inside the selected items, so the existing resources module will be enhanced with a customized search to search inside selected content.

Benefits

Search keywords inside the resources, assist the user with a search inside the resources.

Implementation possibilities

Create an new search box which talks to solr and filters on the url it is in as prefix, call itself to display the results or show another page outside the resources module page.

Drawbacks/Difficulties

No use of solr's faceted search will be made, the implementation will not be that generic.

Choice 2: Using SolR's faceted search

Goal

The goal is to use the faceted search offered by SolR to implement a generic faceted search on all website content.

Benefits

A generic approach with different visualizations are possible, faceting can be done on all the content, not only on resources.

Implementation possibilities

The URL splitted in paths could define a nice already existing categorization, for instance a faceted search today on teh corp website with a url categorization gives teh following results.

...

Filtering out the irrelevant tags could give a nice generic auto categorization. Of course this does not take in consideration the user defined categories through the category module.

Drawbacks/Difficulties

The Solr indexing is URL based, so there need to be way to either add those user selected categories to the URL which would be difficult I guess, or catch the categories from the URL and send them to the solR index.

...

Info

This will work only if there are no multiple categorizations present in the page on different subcontents !

 

 

 

Choice 3: Tagging content inside the page ( meta and micro tags )

Goal

The goal is to enhance the categorization module to tag content inside the page, either for the whole page (meta tag in header or in the div), this way they can be picked up by an external parser or search engine and offer SEO enhancement and in house faceting.

Benefits

Standardized categorization and content tagging, easily exploitable by standard parser tools and search engines.

Micro tags could as well be used to tell the custom magnolia extractor not to index certain content, for a complete page, "robots.txt" can be used

Implementation possibilities

Drawbacks/Difficulties

Multiple categorizations by page