Im using Amazon Web Services with Tomcat to deploy a Java application.
The application consists of a a Lucene index of artist data and a website that allows a user to search for a musical artist (e.g madonna, beatles) it will then return information about that artist generated from the index in the form of a html page. The index doesn't change and hence the page generated for a particular artist never changes. I don't want to pre-calculate the pages because that would take a long time (over a million pages) and require alot of space, but I do want the pages to be cached so that:
- A search for the artist can return existing page instead of having to recreate it.
- I want pages get crawled and indexed by Google so they can be returned in Google search.
How do I go about this (I have plenty of Java experience but not much website experience)
When I say the index never changes I mean that the deployed application uses the same index, and that index is never modified. About once a month a new index will be created and the application redeloyed with the new index.