I ended up using the Sickle package, which I found to have much better documentation and easier to use: This code gets all the sets, and then retrieves each record from each set. This seems like the best solution given the fact that there are more than 30000 records to...
I encountered this error in the past. Looking at my oai.cfg file, I used localhost for some settings and my public URL for others. solr.url=http://localhost/solr/oai # OAI persistent identifier prefix. # Format - oai:PREFIX:HANDLE identifier.prefix = repository.library.georgetown.edu # Base url for bitstreams bitstream.baseUrl = https://repository.library.georgetown.edu If you need to make...
Be sure to read the short and sweet Tutorial. For harvesting an entire OAI-PMH repository, you do not need to iterate over sets. Here is the complete code: from sickle import Sickle sickle = Sickle('http://www.duo.uio.no/oai/request') recs = sickle.ListRecords(metadataPrefix="oai_dc") for r in recs: print r If for some reason you really...
The reason why your attempt doesn't work is that the source XML nodes are in a namespace. You must declare this namespace in your stylesheet, assign it a prefix and use that prefix when addressing the nodes: <xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform" xmlns:oai="http://www.openarchives.org/OAI/2.0/"> <xsl:output method="xml" version="1.0" encoding="utf-8" indent="yes"/> <xsl:template match="/"> <FMPXMLRESULT xmlns="http://www.filemaker.com/fmpxmlresult">...