Number Of Datastreams Per Object vs Ingest Time

Setup

  • 10,000 digital objects generated
  • each object gets an additional datastream, so that n=1 has 1 datastream and n=10,000 has 10,000 datastreams
  • datastreams are inline (CONTROL_GROUP="X"), active and versionable
  • datastreams do not have content except one (mandatory) element
The following xml snippet shows one generated datastream
<foxml:datastream CONTROL_GROUP="X" ID="escidoc.2" STATE="A" VERSIONABLE="true">
 <foxml:datastreamVersion ID="escidoc.2dsid" LABEL="my label" MIMETYPE="text/xml">
  <foxml:xmlContent>
    <dummy/>
  </foxml:xmlContent>
 </foxml:datastreamVersion>
</foxml:datastream>

The ingest finished without problems, showing that it is possible to ingest objects with a number of inline d datastreams as high as 10000. Another observation is the increase in ingest time which seems to become superlinear over time. The images below show the outcome of the test run.

Maximum Number Of Versions

TODO

Add new attachment

In order to upload a new attachment to this page, please use the following box to find the file, then click on “Upload”.

List of attachments

Kind Attachment Name Size Version Date Modified Author Change note
png
fedrep3_mass_datastreams_1200x... 36.33 kB 1 Tue May 20 09:53:14 CEST 2008 KST
png
fedrep3_mass_datastreams_600x4... 9.6 kB 1 Tue May 20 09:53:12 CEST 2008 KST
png
fedrep3_mass_datastreams_avg_1... 12.896 kB 1 Tue May 20 09:53:22 CEST 2008 KST
png
fedrep3_mass_datastreams_avg_6... 5.095 kB 1 Tue May 20 09:53:20 CEST 2008 KST
« This page (revision-1) was last changed on 21-May-2008 12:23 by unknown [RSS]