<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-3808045742964712463</id><updated>2011-09-17T12:36:20.809+01:00</updated><category term='preservation'/><category term='migration'/><category term='JP2K'/><category term='JPEG2000'/><title type='text'>JPEG 2000 at the Wellcome Library</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>27</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-6968019383998381706</id><published>2011-09-09T09:11:00.000+01:00</published><updated>2011-09-09T09:11:00.739+01:00</updated><title type='text'>Simplifying our JPEG2000 conversion workflow</title><content type='html'>Over the summer, we have been working to streamline our JPEG 2000 conversion workflow. With the help of software developers from &lt;a href="http://www.genisys.co.uk/index.html"&gt;Genisys &lt;/a&gt;- one of the Trust’s strategic IT development and support partners - we have put the LuraWave command line interface to use in automating batch conversion.&lt;br /&gt;&lt;br /&gt;Up to now we have been using the native GUI interface that comes with the LuraWave software, manually entering parameters and initiating the conversion process for each batch of images. This was useful for us as we settled into a large-scale digitisation workflow incorporating RAW - TIFF - JP2 conversion, cleared our backlog and established our compression testing methodology (as described in previous posts on this blog). With no relevant in-house programming expertise, the GUI was essential during these early stages.&amp;nbsp; &lt;br /&gt;&lt;br /&gt;Now that we have a firm idea of how we want to use LuraWave, where it fits into the overall workflow, and what kind of throughput we need on a day-to-day basis, it was time to set up an automated solution. &lt;br /&gt;&lt;br /&gt;The Wellcome Trust operates in an (almost) entirely Windows environment, so we commissioned the Genisys software engineers to code a .NET wrapper script running as an executable.&amp;nbsp; The wrapper script invokes LuraWave’s command line conversion to allow us to convert images with no manual intervention. An XML configuration file that contains the following information is used to control how the wrapper script invokes LuraWave:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"Inbox" directory (files ready for conversion)&lt;/li&gt;&lt;li&gt;Temporary directory (files copied before conversion)&lt;/li&gt;&lt;li&gt;"Outbox" directory (converted files)&lt;/li&gt;&lt;li&gt;LuraWave command line&lt;/li&gt;&lt;li&gt;Error directory&lt;/li&gt;&lt;li&gt;List of any files to exclude from conversion&lt;/li&gt;&lt;/ul&gt;LuraWave retains the original folder structure, so the "Inbox" and "Outbox" is the top level directory, with the original folder hierarchy maintained throughout the conversion process. &lt;br /&gt;&lt;br /&gt;Polling of the specified input folder is handled with Windows Scheduler, which can be run on a PC or on a server (we run it on a virtual server). Every 5 minutes Windows Scheduler prompts the script to check for TIFFs in the "Inbox".&amp;nbsp; Lurawave is then invoked, converting the TIFFs to JP2s that are copied out to the “Outbox”.&amp;nbsp; We’ve got some really good error handling in place so if one rogue file can’t be converted the rest of the files still get converted – essential when converting big volumes, we don’t want the first file failing and halting an overnight run of thousands of files.&lt;br /&gt;&lt;br /&gt;Windows Scheduler does not parallel process, so folders are queued for conversion. With speeds of around 30Gb (at least 1,200 TIFFs) per hour, this is quick enough for our needs. &lt;br /&gt;&lt;br /&gt;This implementation means that a single LuraWave license can be used for any number of input streams, and with the facility to "call" multiple definitions; it can also convert images to multiple JPEG 2000 profiles (we currently have a lossless profile and a lossy profile). &lt;br /&gt;&lt;br /&gt;&lt;i&gt;With thanks to Alastair Reid, Wellcome Trust IT Account Manager, for providing this information and reviewing this post.&lt;/i&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-6968019383998381706?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/6968019383998381706/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=6968019383998381706&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/6968019383998381706'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/6968019383998381706'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2011/09/simplifying-our-jpeg2000-conversion.html' title='Simplifying our JPEG2000 conversion workflow'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-6561715087637009714</id><published>2011-06-21T18:12:00.010+01:00</published><updated>2011-07-20T09:16:40.079+01:00</updated><title type='text'>Thoughts on the 2011 JP2 Summit</title><content type='html'>I attended the JP2 Summit in Washington D.C. in May (initiated and organised by Robert Buckley and Steve Puglia and hosted by the Library of Congress) representing both the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;Wellcome&lt;/span&gt; Library and the JP2K-UK Working Group. I found this event an interesting counterpart to the &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/search?q=highlights"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;JPEG&lt;/span&gt;2000 Seminar&lt;/a&gt; we held here at the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;Wellcome&lt;/span&gt; Trust last year.&lt;br /&gt;&lt;br /&gt;There were around 90 people at the Summit, most from the D.C. area and eastern seaboard cultural institutions such as the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;LoC&lt;/span&gt;; National Archives; Smithsonian libraries and archives; a range of university libraries including Yale, Harvard, U. of Virginia, &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;UConn&lt;/span&gt;; NARA; and many others. The level of experience in digital imaging and preservation was generally quite high, while the understanding of &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_5"&gt;JPEG&lt;/span&gt;2000 ranged from very little to highly informed. Nearly a mirror audience to the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_6"&gt;Wellcome&lt;/span&gt; Trust event, although perhaps with fewer privately funded organisations represented (although there were some, including Google).&lt;br /&gt;&lt;br /&gt;The day began with a tutorial by Robert Buckley, and although I had heard much of this in previous presentations, or through reading up on JP2, I always find it hard to keep the details fresh in my mind. So it was useful to get a refresher, and it set the stage well for people who had little knowledge of the technical issues and background to the format.&lt;br /&gt;&lt;br /&gt;After the tutorial, there was a series of presentations, all of which are listed on the &lt;a href="http://www.digitizationguidelines.gov/resources/jpeg2000.html"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_7"&gt;JPEG&lt;/span&gt;2000 page&lt;/a&gt; of the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_8"&gt;FADGI&lt;/span&gt; website. I won't go into the details here (and you can read more on Steve &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_9"&gt;Puglia's&lt;/span&gt; &lt;a href="http://blogs.loc.gov/digitalpreservation/2011/06/a-fine-view-at-the-summit-of-jp2/"&gt;blog post&lt;/a&gt;), but we heard about a range of practical issues around use of JP2 for newspaper digitisation, digital video, special collections and Google books; technical developments around implementing JP2 as part of a &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_10"&gt;workflow&lt;/span&gt; including quality assurance and issues of long-term preservation; and the results of a survey of use and attitudes toward JP2 in libraries and archives.&lt;br /&gt;&lt;br /&gt;In the library and archive community JP2 is being adopted mainly for mass digitisation with storage costs being the primary driver - there is no denying that. What was clear here - as with the presentations given last year - was that while JP2 is not yet the most practical solution in terms of &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_11"&gt;usability&lt;/span&gt;, it is becoming more and more widely accepted for its flexibility and robustness as well as for its space-saving intelligent compression. With increasing knowledge of the format practitioners are now coming to see JP2 in the context of these other important features, and investigating - even demanding - ways to use these other features more easily.&lt;br /&gt;&lt;br /&gt;Of course, not everyone is 100% convinced that JP2 can meet the needs of digital archiving, or digital image delivery. Many concerns seem to have been appeased by the presentations and tutorial - simply by finding out how many people are using the format, and how much value they get from it. There are still barriers to people taking up JP2 more enthusiastically - mainly around the lack of adoption by digital cameras and browsers, loss of information in &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_12"&gt;lossy&lt;/span&gt; compression, risk that there still isn't a wide enough take-up in the community to maintain the currency of the format in the longer term, and the small range of tools for implementing the format that simply can't meet their needs.&lt;br /&gt;&lt;br /&gt;The second day of the Summit finished off with a small-group discussion session around JP2 implementation. For me, the most interesting part of this discussion was around community building.&lt;br /&gt;&lt;br /&gt;While we may never see digital cameras &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_13"&gt;natively&lt;/span&gt; producing JP2s, for example, some barriers can be broken down by simply sharing. Information on and results of testing, tools and ways to use them, &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_14"&gt;workflow&lt;/span&gt; advice, and preservation technologies are all important and can easily be shared. Use of JP2 doesn't always boil down to technical reassessment however. There is also revisiting certain aspects of digital preservation strategy such as defining significant properties/data, predicting migration scenarios and what that really entails, determining what the &lt;span style="FONT-STYLE: italic"&gt;use&lt;/span&gt; of the digital content really is. It is also recognising emotional responses to preservation risks and the fact that these decisions have a long-term effect, shaping the legacy of entire collections. The leap to JP2 is best done in collaboration, and moral support should not be discounted!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-6561715087637009714?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/6561715087637009714/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=6561715087637009714&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/6561715087637009714'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/6561715087637009714'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2011/06/thoughts-on-2011-jp2-summit.html' title='Thoughts on the 2011 JP2 Summit'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-7038205348264571761</id><published>2011-06-15T15:41:00.002+01:00</published><updated>2011-06-15T15:45:25.228+01:00</updated><title type='text'>The JP2K-UK wiki has moved</title><content type='html'>The wiki created as part of the JP2K-UK working group has been moved to a &lt;a href="http://wiki.opf-labs.org/display/JP2/Home"&gt;dedicated space&lt;/a&gt; on the Open Planets wiki. The content has now been transferred and is in the process of being updated and added to. We welcome contributions - all you have to do is log into the&lt;a href="http://wiki.opf-labs.org/display/KB/Home"&gt; OPF wiki&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-7038205348264571761?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/7038205348264571761/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=7038205348264571761&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/7038205348264571761'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/7038205348264571761'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2011/06/jp2k-uk-wiki-has-moved.html' title='The JP2K-UK wiki has moved'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-1297667260010609995</id><published>2011-05-27T09:33:00.000+01:00</published><updated>2011-05-27T15:53:18.347+01:00</updated><title type='text'>ICC profiles and LuraWave</title><content type='html'>Johan van &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;der&lt;/span&gt; &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;Knijff's&lt;/span&gt; long-awaited D-Lib paper &lt;a href="http://www.dlib.org/dlib/may11/vanderknijff/05vanderknijff.html"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;JPEG&lt;/span&gt; 2000 for long term preservation: JP2 as a preservation format&lt;/a&gt;, has now come out. In this paper he mentions the various ways &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;LuraWave&lt;/span&gt; has handled colour profile information, and I thought it was a good time to elaborate some on the developments we have commissioned from &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;Luratech&lt;/span&gt; regarding this issue.&lt;br /&gt;&lt;br /&gt;As Johan mentions in the paper, when we &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2010/07/finding-jpeg-2000-conversion-tool.html"&gt;started using &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_5"&gt;LuraWave&lt;/span&gt;&lt;/a&gt; and carrying out &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_6"&gt;JHOVE&lt;/span&gt; testing to determine whether the files were compliant with the standard, we found that where an ICC display profile was included in the TIFF (and this was virtual standard across our image set) &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_7"&gt;LuraWave&lt;/span&gt; automatically encoded the file as &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_8"&gt;JPX&lt;/span&gt; in a JP2 wrapper. This ensured compliance with the standard, but we were not happy with using &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_9"&gt;JPX&lt;/span&gt;. So we asked &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_10"&gt;Luratech&lt;/span&gt; to modify &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_11"&gt;LuraWave&lt;/span&gt; to include an additional command that allowed us to tell the application to ignore the ICC profile completely. This meant that we got a 100% JP2 file, but the colour profile information was then stripped out.&lt;br /&gt;&lt;br /&gt;We wanted to include a colour profile in our digital image files. This prevents ambiguity when decoding the images in an image editor or image viewer. We were left with only one option - convert everything to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_12"&gt;sRGB&lt;/span&gt; and allow &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_13"&gt;LuraWave&lt;/span&gt; to include the numerical value of &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_14"&gt;sRGB&lt;/span&gt; in the file, which is allowed by the standard. Adobe &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_15"&gt;RBG&lt;/span&gt; 1998, as Johan explains in detail in his article, is allowed only as an &lt;span style="font-style: italic;"&gt;input &lt;/span&gt;profile, and our images did not include an input profile (and we didn't know how we could go about adding an input profile to our images).&lt;br /&gt;&lt;br /&gt;We knew that it wouldn't matter to us, to the user, or to the decoding programme, how the profile was labelled - as long as it was there. It mattered only to the standard. So we asked &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_16"&gt;Luratech&lt;/span&gt; to modify &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_17"&gt;LuraWave&lt;/span&gt; yet again in order to read the display profile in our &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_18"&gt;TIFFs&lt;/span&gt; and embed it into the JP2 file as an input profile. It is not an input profile. But we were limited by the standard, and this was our best option within those limitations to ensure we could include colour information without having to limit ourselves to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_19"&gt;sRGB&lt;/span&gt; - and without having to add in a &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_20"&gt;workflow&lt;/span&gt; step to convert all our legacy images to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_21"&gt;sRGB&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;This is the version of &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_22"&gt;LuraWave&lt;/span&gt; that we currently use (2.1.22.10 - which includes other enhancements around improving performance, as reported in an &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2011/01/tiff-to-jpeg-2000-backlog-losslessness.html"&gt;earlier blog post&lt;/a&gt;). However - since Johan has succeeded in raising awareness of the deficient colour space provision in the standard, leading to &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2011/04/guest-post-color-in-jp2.html"&gt;agreement in the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_23"&gt;JPEG&lt;/span&gt; Committee&lt;/a&gt; to change the standard to accommodate real use scenarios such as our own, we can envisage requesting further changes to the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_24"&gt;LuraWave&lt;/span&gt; command tool once this is finalised.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-1297667260010609995?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/1297667260010609995/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=1297667260010609995&amp;isPopup=true' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1297667260010609995'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1297667260010609995'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2011/05/icc-profiles-and-lurawave.html' title='ICC profiles and LuraWave'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-4734262021045366332</id><published>2011-04-28T09:07:00.006+01:00</published><updated>2011-04-28T09:50:40.635+01:00</updated><title type='text'>Guest post: Color in JP2</title><content type='html'>&lt;span style="font-style: italic;"&gt;Rob Buckley, colour imaging expert and author of &lt;/span&gt;&lt;a href="http://library.wellcome.ac.uk/assets/wtx056572.pdf"&gt;&lt;span&gt;JPEG 2000 as a Preservation and Access Format for the Wellcome Library&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic;"&gt;, writes about the implementation of colour space metadata in the JP2 format and planned changes to the specification to better accommodate this information.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;When I talk about JPEG 2000, I point out that most if not all still image applications that use JPEG 2000, especially in the cultural heritage community, can be satisfied with the JP2 file format. JP2 is the basic file format defined in &lt;a href="http://www.jpeg.org/jpeg2000/j2kpart1.html"&gt;Part 1&lt;/a&gt; of the JPEG 2000 standard, along with the core decoder. &lt;a href="http://www.jpeg.org/jpeg2000/j2kpart2.html"&gt;Part 2&lt;/a&gt; of the standard defines extended versions of both the file format and decoder, offering features aimed at specialized or advanced applications.&lt;br /&gt;&lt;br /&gt;One point of confusion about the use of JP2 has had to do with its support for &lt;span style="font-weight: bold;"&gt;color spaces&lt;/span&gt;. When we were developing JP2 in the late 1990’s (JPEG 2000 was intended to come out in 2000), the application that most influenced the design was digital photography—JP2 was expected to be the next digital camera format. So support for sRGB was built in, along with support for the YCC and grayscale versions of sRGB. Other RGB color spaces used for image capture would be supported by using ICC input profiles, leaving aside display and output profiles. However, not all ICC input profiles were allowed: support was restricted to the ones needed for grayscale and RGB image data. Not supported and considered too complex for applications without a full color management engine was the input profile type that used a full multi-dimensional lookup-table. So users had the choice of specifying color in a JP2 file by name as sRGB (or sYCC or sGray) or via a simple ICC input profile.&lt;br /&gt;&lt;br /&gt;After the release of the JPEG 2000 standard, two things happened. First digital cameras kept exporting the JPEG Baseline format; when they added a new export format, it was Raw and not JP2. The drive was toward more creative control rather than better compression when what they had was good enough.&lt;br /&gt;&lt;br /&gt;The second thing was that most people ended up using &lt;span style="font-weight: bold;"&gt;ICC &lt;span style="font-style: italic;"&gt;display &lt;/span&gt;profiles&lt;/span&gt; for RGB spaces rather than input profiles. A small thing you’d think, especially when the only difference between the display profiles they used and the input profiles supported by JP2 was the profile class value in the profile’s header: except for that, the data content of the two profile types is identical for RGB color spaces. As a result, I could take a JP2 file containing an RGB display profile (which technically makes the JP2 file illegal) change the profile class from display to input (by changing four bytes in the profile header and leaving everything else the same) and produce a legal JP2 file. It turns out that most readers ignore this value anyway and read the file fine either way. Using the extended file format was no help because it only extended color support to all types of input profiles, plus some other named and vendor-specified color spaces.&lt;br /&gt;&lt;br /&gt;This confusion needed to be addressed as more and more institutions are using JP2 as a long-term preservation format, where predictability and clarity are prized. The solution is straightforward: amend the JP2 file format specification, aligning it with current practice so that it supports ICC display profiles as well as the set of input profiles it supports now.&lt;br /&gt;&lt;br /&gt;And this is what is happening. Richard Clark and I led an activity that culminated in the JPEG 2000 committee approving a new activity to amend JP2 when it met this past February in Tokyo. This means that JP2 will support a wide range of RGB color spaces, which was the original intent, via both ICC input and display profiles. Since the JP2 spec was first issued, the ICC spec has undergone a major revision from V2 to V4 and been issued as an ISO standard. While this revision hardly affects the profiles used for RGB color spaces, it will also be addressed as part of the amendment. (The amendment will also address the ambiguity in the JP2 definition of resolution that Johan van der Knijff has &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2010/12/guest-post-ensuring-suitability-of-jpeg.html"&gt;brought up&lt;/a&gt; on this blog.)&lt;br /&gt;&lt;br /&gt;The final outcome of all this will be a JP2 file format standard that aligns with current practice; supports RGB spaces such as Adobe RGB 1998, ProPhoto RGB and eci RGB v2; and provides a smooth migration path from TIFF masters as JP2 increasingly becomes used as an image preservation format.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-4734262021045366332?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/4734262021045366332/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=4734262021045366332&amp;isPopup=true' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/4734262021045366332'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/4734262021045366332'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2011/04/guest-post-color-in-jp2.html' title='Guest post: Color in JP2'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-2552148679871955249</id><published>2011-01-28T09:45:00.002Z</published><updated>2011-01-28T09:53:25.419Z</updated><title type='text'>TIFF to JPEG 2000 backlog, losslessness, and a perplexing speed issue</title><content type='html'>In October 2010 we initiated our "TIFF to JPEG 2000 backlog project", an endeavor to convert all the legacy images that make up our current image archive (&lt;a href="http://images.wellcome.ac.uk/"&gt;Wellcome Images&lt;/a&gt;), as well as around 120,000 images that had been created during the Archives digitisation &lt;a href="http://library.wellcome.ac.uk/doc_WTX057852.html"&gt;project&lt;/a&gt;. Over 450,000 images comprise the backlog, saved in a multitude of folders, on different servers on our Pillar SAN storage system. Converting the Wellcome Images TIFFs to &lt;span style="FONT-WEIGHT: bold"&gt;lossless &lt;/span&gt;JPEG 2000 will save us around 12 Tb of storage space alone.&lt;br /&gt;&lt;br /&gt;Why lossless, you ask? We have indeed expounded on the merits of lossy compression for large image sets created as a result of digitisation projects. But there is a significant difference with regards to the backlog project. While digitisation projects are usually carried out on collections of material that have fairly similar physical formats (modern printed books, paper documents, Arabic manuscripts, etc.), lending themselves to a &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2010/08/as-result-of-our-decision-to-go-lossy.html"&gt;generalised approach&lt;/a&gt; to compression determined via testing, this backlog project has no overall commonality (other than that they are all TIFFs of one flavour or another). Wellcome Images is populated one image at a time, or by small sets of images, including born digital photography and represent a cross-section of hundreds of different content types. There was no feasible way to group these images into sets that could be assessed for compression tolerance. The decision was made, therefore, to convert the entire Wellcome Images backlog to lossless JP2 files, thus removing any doubt whether the compression levels were appropriate.&lt;br /&gt;&lt;br /&gt;During the initial stages of this project, we tested our installation of the LuraWave conversion tool (v.2.1.21.10) with high volumes of images stored on our network storage (as all the archived TIFFs are). What we found surprised us - instead of 20 min or so we expected for a batch of around 600 25Mb images, it was taking all night (around 6 hours). Was it a bandwidth issue? With the support of our IT team we carried out tests over the 1Gb network area. It was still unacceptably slow, showing that bandwith was not the issue. We moved the same batch of images onto the local hard drive of the machine that LuraWave was installed on, and confirmed that, yes, LuraWave can convert those images in around 20 min when they are colocated.&lt;br /&gt;&lt;br /&gt;We turned to our suppliers, LuraTech, who quickly ferreted out the problem. LuraWave was programmed to convert images in parallel, to speed up the process, but it also buffers images in parallel. This buffering process, when carried out across our 100Mb network cable, slowed down considerably due to the parallel running. LuraTech modified the programme to cache each image onto the local disk first, individually, before then buffering and converting in parallel as usual. This brought the overall time down by 80%. The version we are currently using is 2.1.22.10.&lt;br /&gt;&lt;br /&gt;In practice our approach has been tailored to suit individual sets of images within our backlog. A balance has to be struck between ease of use and the practicalities of applying multiple processing stages to files over a 100Mb network. Some image sets are copied locally to external hard drives, taking advantage of the speed gains this gives, whereas others that are more straightforward can be processed directly over the network using the much improved processing speeds. The combined effeciencies made converting our entire backlog feasible within the timeframe we had to spend on it.&lt;br /&gt;&lt;br /&gt;We are now about a third of the way through the conversion backlog, and on track to become virtually TIFF-free by May 2011. What I haven't mentioned is the colour profile embedding issues that cropped up, the legacy colour space problems, and the work LuraTech did in addressing these issues - the topic of a future blog post.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-2552148679871955249?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/2552148679871955249/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=2552148679871955249&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/2552148679871955249'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/2552148679871955249'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2011/01/tiff-to-jpeg-2000-backlog-losslessness.html' title='TIFF to JPEG 2000 backlog, losslessness, and a perplexing speed issue'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-3590585173788919523</id><published>2010-12-20T16:20:00.010Z</published><updated>2010-12-22T10:14:59.461Z</updated><title type='text'>Guest post: LoC response to discussion on long-term preservation of JPEG 2000</title><content type='html'>&lt;span style="font-style: italic;"&gt;Carl Fleischhauer, Program Officer at &lt;a href="http://www.digitalpreservation.gov/"&gt;NDIIPP&lt;/a&gt;, Library of Congress, responds to recent posts from &lt;/span&gt;&lt;a style="font-style: italic;" href="http://jpeg2000wellcomelibrary.blogspot.com/2010/12/guest-post-ensuring-suitability-of-jpeg.html"&gt;Johan van der Knijff &lt;/a&gt;&lt;span style="font-style: italic;"&gt;and the &lt;/span&gt;&lt;a style="font-style: italic;" href="http://jpeg2000wellcomelibrary.blogspot.com/2010/12/suitability-of-jpeg2000-for.html"&gt;Wellcome Library&lt;/a&gt;&lt;span style="font-style: italic;"&gt; regarding long-term preservation of JPEG 2000. Both posts mentioned the need to rate the JPEG 2000 format for long-term sustainability using criteria drawn up by the Library of Congress and the National Archives, UK (we have helpfully created an openly available/editable &lt;a href="http://tinyurl.com/39j267t"&gt;Google doc&lt;/a&gt; to make this a collaborative effort). &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Thanks for provocative blogs&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Thanks to Johan van der Knijff and Dave Thompson for the helpful blog postings here that frame some important questions about the sustainability of the JPEG 2000 format.  &lt;a href="http://www.digitalpreservation.gov/partners/pioneers/detail_arms.html"&gt;Caroline Arms &lt;/a&gt;and I were flattered to see that our &lt;a href="http://www.digitalpreservation.gov/formats/intro/format_eval_rel.shtml#factors"&gt;list of format-assessment factors&lt;/a&gt; was cited, along with the criteria developed at the UK National Archives.  We certainly agree that many of these factors have a theoretical turn and that judgments about sustainability must be leavened by actual experience.&lt;br /&gt;&lt;br /&gt;We also call attention to the importance of what we call Quality and Functionality factors (hereafter Q&amp;amp;F factors).  It is possible that some formats will "score" high enough on these factors as to outweigh perceived shortcomings on the Sustainability Factor front.&lt;br /&gt;&lt;br /&gt;As I drafted this response, I benefited from comments from Caroline and Michael Stelmach, the Library of Congress staffer who chairs the &lt;a href="http://www.digitizationguidelines.gov/stillimages/"&gt;Federal Agencies Still Image Digitization Guidelines Working Group&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Colorspace &lt;/span&gt;(as it relates to the LoC's Q&amp;amp;F factor&lt;span style="font-style: italic;"&gt; &lt;/span&gt;Color Maintenance)&lt;br /&gt;&lt;br /&gt;We agree that the JPEG 2000 specification would be improved by the ability to use and declare a wider array of color spaces and/or ICC profile categories.  We join you in endorsing Rob Buckley's valuable work on a JP2 extension to accomplish that outcome.&lt;br /&gt;&lt;br /&gt;When Michael and I were chatting about this topic, he said that he been doing some informal evaluations of the spectra represented in printed matter at the Library of Congress.  This is an informal investigation (so far) and his comment was off the cuff, but he said he had been surprised to see that the colors he had identified in a wide array of original items could indeed be represented within the sRGB color gamut, one of the enumerated color spaces in part 1 of the JPEG 2000 standard.&lt;br /&gt;&lt;br /&gt;Michael added that he knew that some practitioners favor scRGB - not included in the JPEG 2000 enumerated list - either because of scRGB's increased gamut and/or perhaps because it allows for linear-to-intensity representations of brightness rather than only gamma-corrected representations.  The extended gamut - compared to sRGB - will be especially valuable when reproducing items like works of fine art.  And we agree with Johan van der Knijff's statement that there will be times when we will wish to go beyond input-class ICC profiles and embrace 'working' color spaces.  All the more reason to support Rob Buckley's effort.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Adoption&lt;/span&gt; (the LoC Sustainability criteria includes adoption as a factor)&lt;br /&gt;&lt;br /&gt;This is an area in which we all have mixed feelings: there is adoption of JPEG 2000 in some application areas but we wish there were more.  Caroline pointed to one positive indicator: many practitioners who preserve and present high-pixel-count images like scanned maps, have embraced JPEG 2000 in part because of its support for efficient panning and zooming.  The online presentation of maps at the Library of Congress is one good &lt;a href="http://memory.loc.gov/ammem/gmdhtml/gmdhome.html"&gt;example&lt;/a&gt; (for a given map you see an 'old' JPEG in the browser, generated from JPEG 2000 data under the covers).&lt;br /&gt;&lt;br /&gt;Caroline adds that the geospatial community uses JPEG 2000 as a standard (publicly documented, non-proprietary) alternative to the proprietary MrSID.  Both formats continue to be used.  LizardTech tools now support both equally.  Meanwhile, GeoTIFF is used a lot too.  Caroline notes that LizardTech re-introduced a free stand-alone viewer for JPEG2000/MrSID images last year in response to customer demand.  And a new service for solar physics from NASA, Helioviewer, is based on JPEG2000. NASA includes a &lt;a href="http://helioviewer.nascom.nasa.gov/wiki/Helioviewer.org"&gt;justification &lt;/a&gt;for using the format on their website.&lt;br /&gt;&lt;br /&gt;For my part, I can report encountering some JPEG 2000 uptake in moving image circles, ranging from its use in the digital cinema's 'package' specification (see a slightly out of date &lt;a href="http://www.digitalpreservation.gov/formats/fdd/fdd000200.shtml"&gt;summary&lt;/a&gt;) to its inclusion in Front Porch Digital's &lt;a href="http://www.fpdigital.com/Solutions/Migrate/"&gt;SAMMA device&lt;/a&gt;, used to reformat videotapes in a number of archives, including the Library of Congress.&lt;br /&gt;&lt;br /&gt;Meanwhile, Michael recalled seeing papers that explored the use of JPEG 2000 compression in medical imaging (where JPEG 2000 is an option in the DICOM standard), with findings that indicated that diagnoses were just as successful in JPEG 2000 compressed images as they were when radiologists consulted uncompressed images. An online search using a set of terms like "JPEG2000, medical imaging, radiology" will turn up a number of relevant articles on this topic, including Juan Paz &lt;span style="font-style: italic;"&gt;et al&lt;/span&gt;, 2009,  "&lt;a href="http://dx.doi.org/10.1118/1.3233783"&gt;Impact of JPEG 2000 compression on lesion detection in MR imaging&lt;/a&gt;," in &lt;span style="font-style: italic;"&gt;Medical Physics&lt;/span&gt;, which provides evidence to this effect.&lt;br /&gt;&lt;br /&gt;On the other hand - negative indicators, I guess - we have the example of non-adoption by professional still photographers.  On the creation-and-archiving side, their fondness for retaining sensor data motivates them to retain raw files or to wrap that raw data in DNG.  I was curious about the delivery side, and looked at the useful &lt;a href="http://www.dpbestflow.org/"&gt;dpBestFlow&lt;/a&gt; website and book, finding that the author-photographer Richard Anderson reports that he and his professional brethren deliver the following to their customers: RGB or CMYK files (I assume in TIFF or one of the pre-press PDF wrappers), "camera JPEGs" (old style), "camera TIFFs," or DNGs or raw files.  There is no question that the lack of uptake of JPEG 2000 by professional photographers hampers the broader adoption of JPEG 2000.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Software tools &lt;/span&gt;(their existence is part of the Sustainability Factor of Adoption; their misbehavior is, um, misbehavior)&lt;br /&gt;&lt;br /&gt;It was very instructive to see Johan van der Knijff's &lt;a href="http://www.dpconline.org/component/docman/doc_download/526-jp2knov2010vanderkniff"&gt;report &lt;/a&gt;on his experiments with LuraTech, Kakadu, PhotoShop, and ImageMagick.  If he is correct, these packages do misbehave a bit and we should all encourage the manufacturers to fix what is broken.  There is of course a dynamic between the application developers and adoption by their customers.  If there is not greater uptake in realms like professional photography, will the software developers like Adobe take the time to fix things or even continue to support the JPEG 2000 side of their products?&lt;br /&gt;&lt;br /&gt;Caroline, Michael, and I pondered Johan van der Knijff's suggestion that "the best way to ensure sustainability of JPEG 2000 and the JP2 format would be to invest in a truly open JP2 software library."  We found ourselves of two minds about this.  On the one hand, such a thing would be very helpful but, on the other, building such a package is definitely a non-trivial exercise.  What level of functionality would be desired?  The more we want, the more difficult to build.  Johan van der Knijff's comments about JasPer remind us that some open source packages never receive enough labor to produce a product that rivals commercial software in terms of reliability, robustness, and functional richness.  Would we be happy with a play-only application, to let us read the files we created years earlier with commercial packages that, by that future time, are defunct?  In effect such an application would be the front end of a format-migration tool, restoring the raster data so that it can be re-encoded into our new preferred format.  As we thought about this, we wondered if people would come forward to continue to update the software for new programming languages and operating systems, to keep them in operation to ensure that they are still working.&lt;br /&gt;&lt;br /&gt;As a sidebar, Johan van der Knijff summarizes David Rosenthal's argument that "preserving the specifications of a file format doesn’t contribute anything to practical digital preservation" and "the availability of working open-source rendering software is much more important."  We would like to assert that you gotta have 'em both: it would be no good to have the software and not the spec to back it up.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Error resilience&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Preamble to this point:  In drafting this, I puzzled over the fit of error resilience to our Sustainability and Quality/Functionality factors.  In our &lt;a href="http://www.digitalpreservation.gov/formats/fdd/fdd000138.shtml"&gt;description &lt;/a&gt;of JPEG 2000 core coding we mention error resilience  in the Q&amp;amp;F slot Beyond Normal.   But this might not be the best place for it.  Caroline points out that error resilience applies beyond images and she notes that it may conflict with transparency (one of our Sustainability Factors).  We find ourselves wishing for a bit of discussion of this sub-topic.  Should error resilience be added as a Sustainability Factor, or expressed within one of the existing factors?  Meanwhile, how important is transparency as a factor?&lt;br /&gt;&lt;br /&gt;Here's the point in the case of JPEG 2000:  Johan van der Knijff's blog does not comment on the error resilience elements in the JPEG 2000 specification.  These are summarized in annex J, section 7, of the specification (pages 167-68 in the 2004 version), where the need for error resilience is associated with the "delivery of image data over different types of communication channels."  We have heard varying opinions about the potential impact of these elements on long term preservation but tend to feel, "it can't be bad."&lt;br /&gt;&lt;br /&gt;Here are a few of the elements, as outlined in annex J.7:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;The  entropy coding of the quantized coefficients is done within code-blocks.   Since the encoding and decoding of the code-blocks are independent,  bit errors in the bit stream of a code-block will be contained within  that code-block.&lt;/li&gt;&lt;li&gt;Termination of the arithmetic coder is allowed  after every coding pass. Also, the contexts may be reset after each  coding pass. This allows the arithmetic coder to continue to decode  coding passes after errors.&lt;/li&gt;&lt;li&gt;The optional arithmetic coding bypass  style puts raw bits into the bit stream without arithmetic coding. This  prevents the types of error propagation to which variable length coding  is susceptible.&lt;/li&gt;&lt;li&gt;Short packets are achieved by moving the packet  headers to the PPM (Packed Packet headers, Main header marker) or PPT  (Packed packet header, Tile-part header marker) segments.  If there are  errors, the packet headers in the PPM or PPT marker segments can still  be associated with the correct packet by using the sequence number in  the SOP (Start of Packet marker).&lt;/li&gt;&lt;li&gt;A segmentation symbol is a  special symbol. The correct decoding of this symbol confirms the  correctness of the decoding of this bit-plane which allows error  detection.&lt;/li&gt;&lt;li&gt;A packet with a resynchronization marker SOP allows  spatial partitioning and resynchronization. This is placed in front of  every packet in a tile with a sequence number stating at zero. It is  incremented with each packet.&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-weight: bold;"&gt;Conclusion&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Thanks to the Wellcome Library for helping all of us focus on this important topic.  We look forward to a continuing conversation.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-3590585173788919523?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/3590585173788919523/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=3590585173788919523&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/3590585173788919523'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/3590585173788919523'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/12/guest-post-loc-response-to-discussion.html' title='Guest post: LoC response to discussion on long-term preservation of JPEG 2000'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-2799499906277245967</id><published>2010-12-08T14:01:00.008Z</published><updated>2010-12-08T17:45:08.169Z</updated><title type='text'>Suitability of JPEG2000 for preservation, help us do some further work</title><content type='html'>Following on from Johan van der Knijff's &lt;a href="http://tinyurl.com/35pnxvs"&gt;guest post&lt;/a&gt; on this blog we were interested in following up issues that Johan raised. If, as Johan suggests, there are some gaps in the tool sets available for working with JPEG2000 in a reliable way and if some of the long term preservation issues are not well understood, perhaps we could begin to explore where the gaps are. Specifically, we were wondering if we could compare the suitability of just one part of JPEG2000 - the JP2 format - for long term preservation against the two sets of criteria that Johan mentioned.&lt;br /&gt;&lt;br /&gt;These criteria were&lt;br /&gt;&lt;br /&gt;1. The &lt;a href="http://tinyurl.com/342q9pe"&gt;&lt;span style="font-weight: bold;"&gt;Library of Congress Sustainability of Digital Formats Planning for Library of Congress Collections&lt;/span&gt;&lt;/a&gt;, and&lt;br /&gt;2. The &lt;a style="font-weight: bold;" href="http://tinyurl.com/3xbb9mr"&gt;National Archives Digital Preservation Guidance Note 1:&lt;/a&gt;&lt;span style="font-weight: bold;"&gt; &lt;/span&gt;Selecting file formats for long-term preservation.&lt;br /&gt;&lt;br /&gt;Our thinking is that we could do a quick, targeted exercise utilising our community expertise to provide an overview that might reveal useful areas for future research. We propose to limit our investigation to just the JP2 format (for now) and the two sets of suitability criteria. We're looking for high level properties of the JP2 format in relation to the TNA and LoC criteria. High level in the sense that we think that it should be possible to set out properties of JP2 as a series of bullet points against each of the TNA and LoC criteria. It's not a perfect approach by any means, but as a starting point it seems to offer interesting possibilities.&lt;br /&gt;&lt;br /&gt;It's not meant to be definitive, but to serve as an information sharing exercise to help non-technical archivists/librarians better understand the suitability of JP2 to long term preservation, and to highlight areas where more work may be required. In this way we hope to point the way for developers and the more technically minded to do further work that makes JPEG2000 a more suitable format for long term preservation by providing better information/documentation to support that.&lt;br /&gt;&lt;br /&gt;So we're asking you to collaborate with us in this piece of work. We've created a framework document and put it onto &lt;a href="http://tinyurl.com/39j267t"&gt;GoogleDocs&lt;/a&gt;, where it can be viewed and edited. This document summarises the TNA and LoC criteria (the full criteria can be seen online, following the links given above) and space to add your response as bullet points in the right hand column.&lt;br /&gt;&lt;br /&gt;Remember that we're thinking about JP2 only and we're looking for a high level overview - so be brief and stick with the bullet points for now. We'll take on the editing and management of the document.&lt;br /&gt;&lt;br /&gt;We will publish the results sometime in early 2011, providing we can get a sufficient and meaningful response. If you have any questions, please ask!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-2799499906277245967?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/2799499906277245967/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=2799499906277245967&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/2799499906277245967'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/2799499906277245967'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/12/suitability-of-jpeg2000-for.html' title='Suitability of JPEG2000 for preservation, help us do some further work'/><author><name>dnt</name><uri>http://www.blogger.com/profile/11218789008554869322</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='21' height='32' src='http://4.bp.blogspot.com/_M2W_1JyTKf8/SQgicvDXBgI/AAAAAAAAAAk/_9wEIMolmas/S220/dnt_digpres_awards_pic_2006.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-3966460780451014018</id><published>2010-12-02T09:47:00.007Z</published><updated>2010-12-02T12:25:18.046Z</updated><title type='text'>Guest post: Ensuring the suitability of JPEG 2000 for preservation</title><content type='html'>&lt;em&gt;Johan van der Knijff, of the KB/National Library of the Netherlands, follows up his presentation at the JPEG 2000 seminar with a guest blog post on long-term preservation of JPEG 2000.&lt;/em&gt;&lt;br /&gt;&lt;br /&gt;In my &lt;a href="http://www.dpconline.org/component/docman/doc_download/526-jp2knov2010vanderkniff"&gt;presentation &lt;/a&gt;during the JPEG 2000 seminar I discussed the suitability of JPEG 2000 (and more specifically its JP2 format) for long-term preservation. I highlighted the erroneous restriction in the JP2 (and JPX) format specification that only allows ICC profiles of the 'input' class to be used. This effectively prohibits the use of all working colour spaces such as Adobe RGB, which are defined using 'display device' profiles. I also showed how different software vendors interpret the format specification in subtly different ways, and how such issues can create problems in the long term, such as the loss of colour space and resolution information after some future migration.&lt;br /&gt;&lt;br /&gt;This leads us to the question; to what extent we can predict a specific file format's suitability for long-term preservation. The answer is not that straightforward. The Library of Congress assesses file formats against 7 &lt;a href="http://www.digitalpreservation.gov/formats/sustain/sustain.shtml"&gt;'sustainability factors'&lt;/a&gt;, whereas the National Archives have formulated a list of &lt;a href="http://www.nationalarchives.gov.uk/documents/selecting-file-formats.pdf"&gt;12 criteria&lt;/a&gt;. It is beyond the scope of this blog post to present a detailed analysis of the extent to which JP2 lives up to either set of criteria. However, it is interesting to have a look at whether these criteria could have been helpful in identifying the issues covered by my presentation.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Format specifications&lt;br /&gt;&lt;/strong&gt;First, both the LoC's 'sustainability factors' and the TNA criteria acknowledge the importance of having published specifications of a file format. The LoC uses a 'Disclosure' factor, which refers to “the existence of complete documentation, preferably subject to external expert evaluation”. TNA take this one step further by also defining a 'Documentation Quality' criterion, which expresses the degree to which documentation is comprehensive, accurate and comprehensible. This last criterion largely covers the JPEG 2000 ICC issue, although it's questionable how useful this would have been to identify it a priori. A problem with errors and ambiguities in format specifications is that they can be incredibly easy to overlook, and you may only become aware of them after discovering that different software products interpret the specifications in slightly different ways.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Adoption&lt;/strong&gt;&lt;br /&gt;Formats that are widely used are typically well supported by an array of software tools, and such formats are unlikely to disappear into obsolescence. TNA expresses this through an 'Ubiquity' criterion, which essentially reflects a file format's overall popularity. The definition of the LoC's 'Adoption' factor includes a list of criteria that can be used as “evidence of adoption”. The first set of criteria here includes “bundling of tools with personal computers, native support in Web browsers or market-leading content creation tools, and the existence of many competing products for creation, manipulation, or rendering of digital objects in the format”.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Note that JP2 isn't doing particularly well when measured against any of these criteria. However, the LoC list adds that “a format that has been reviewed by other archival institutions and accepted as a preferred or supported archival format also provides evidence of adoption”. This certainly seems to be the case for JP2. But how relevant is this, really? Going back to the ICC profiles issue: the JP2 file format has been around for about 10 years now, and its acceptance by the archival community has been growing steadily over the last 5 years or so. Yet, this whole issue seems to have gone unnoticed in the archival community for all those years, and I think this is slightly worrying.&lt;br /&gt;&lt;br /&gt;Now let's imagine for a moment that JP2 would have been picked up by the digital photography and graphic design communities. For such uses the ability to do proper colour management is a basic prerequisite, and limiting the support of ICC profiles to the 'input' class would have made the format virtually useless to these user communities. My guess is that in this -entirely fictional- scenario, the format specification would have either improved quickly (based on feedback from the user community), or the respective user communities would have simply stopped using the format altogether. The problem here seems to be that very few people in the archiving community are even aware of such things as colour spaces and colour management, let alone their importance within the context of preservation. With more established formats such as TIFF this may not be as much of a problem, if only because TIFF has been 'road tested' for decades by the photography and graphic design communities. As an archiving community we cannot fall back to any similar 'road testing' in the case of JP2. And this brings me to my next point.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Importance of hands-on experience&lt;/strong&gt;&lt;br /&gt;Preservation criteria such as those of the LoC or TNA are invaluable for assessing the suitability of a format for preservation, but I believe it is equally important to have actual hands-on experience with the tools that are used for creating, modifying, and reading the format. For instance, the TNA criteria use the number of software tools that support a given format as an indicator for the extent of current software support of that format. But knowing the number of tools says nothing about how good or useful these tools actually are! In the case of JP2, quite a large number of (mostly free or open-source) tools exist that, under the hood, are using the open &lt;a href="http://www.ece.uvic.ca/~mdadams/jasper/"&gt;JasPer &lt;/a&gt;library. JasPer is known to have performance and stability issues that make it unsuitable for most professional applications (for which, I should emphasise, it was never developed in the first place!). These issues affect all software tools that are using JasPer. So, only counting the number of available tools may be simply missing the point without incorporating any additional quality criteria. But how would you define these?&lt;br /&gt;&lt;br /&gt;Part of the answer, I think, is that assessing a format's suitability for long-term preservation is not a purely top-down process. Most of the software-related issues that I showed in my presentation were found by simply experimenting with actual files, encoders and characterisation tools: convert a TIFF to JP2; convert it back to TIFF; use existing metadata-extraction and characterisation tools such as &lt;a href="http://www.sno.phy.queensu.ca/~phil/exiftool/"&gt;ExifTool &lt;/a&gt;and &lt;a href="http://hul.harvard.edu/jhove/"&gt;JHOVE &lt;/a&gt;to analyse the in- and output files; try to understand the output of these tools; compare the output before and after the conversion, and so on. Such experiments are extremely useful for getting a feel for the strengths and weaknesses of specific software tools, and they can reveal problems that are not readily captured by pre-defined criteria. In some cases, their results may be used to refine existing criteria, or even add new ones.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Final notes on preservation criteria&lt;br /&gt;&lt;/strong&gt;Although I wouldn’t downplay the importance of preservation criteria such as those used by the LoC or TNA, I think it’s important to realise that such criteria are largely based on theoretical considerations. In most cases they are not based on any empirical data, and as a result their predictive value is largely unknown. For example, an interesting &lt;a href="http://blog.dshr.org/2009/01/are-format-specifications-important-for.html"&gt;blog post &lt;/a&gt;by David Rosenthal argues that preserving the specifications of a file format doesn’t contribute anything to practical digital preservation. According to Rosenthal, the availability of working open-source rendering software is much more important, and he explains how “formats with open source renderers are, for all practical purposes, immune from format obsolescence”.&lt;br /&gt;&lt;br /&gt;This takes us directly to the lack of JPEG 2000-related activity in the open source community, which I also referred to in my presentation. Perhaps the best way to ensure sustainability of JPEG 2000 and the JP2 format would be to invest in a truly open JP2 software library, and release this under a free software license. This could either take the form of the development of a completely new library, or investing in the improvement and further development of an existing one, such as &lt;a href="http://www.openjpeg.org/"&gt;OpenJPEG&lt;/a&gt;. This would require an investment from the archival community, but the payoff may be well worth it.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Acknowledgement&lt;/strong&gt;: this blog entry was largely inspired by an e-mail discussion that was started by Richard Clark, and in particular by a contribution to this discussion by William Kilbride.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-3966460780451014018?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/3966460780451014018/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=3966460780451014018&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/3966460780451014018'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/3966460780451014018'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/12/guest-post-ensuring-suitability-of-jpeg.html' title='Guest post: Ensuring the suitability of JPEG 2000 for preservation'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-1336040179212875596</id><published>2010-11-29T14:52:00.009Z</published><updated>2010-11-29T15:01:25.552Z</updated><title type='text'>Wellcome Library releases an ITT for a Workflow Tracking System</title><content type='html'>If you’ve been reading our blog regularly you’ll know about how the &lt;a href="http://www.wellcome.ac.uk/News/2010/News/WTX062533.htm"&gt;Library&lt;/a&gt; plans to transform itself into a groundbreaking digital resource, allowing access to much of the Library’s material in digital form.&lt;br /&gt;&lt;br /&gt;As part of this program we’ve just released an &lt;a href="http://library.wellcome.ac.uk/doc_wtx052496.html"&gt;ITT for a Workflow Tracking System&lt;/a&gt;. We’re looking for a system that will track and manage the processes around creating digital content – whether that content is digitised by us, digitised externally or born digital archival material- and automating that activity as much as possible.&lt;br /&gt;&lt;br /&gt;Within the Library, staff who want to add content to our Digital Library will do so using the Workflow Tracking System. This means using the WTS to record that all digital content, e.g. digitised books or archival collections, has been created correctly, has had its descriptive metadata attached, is converted to JPEG2000 (or some other appropriate format) and is ingested into our digital object repository. The WTS will also create metadata encoding and transmission standard (METS) files. These will be used by the front end system to deliver digital content to our users.&lt;br /&gt;&lt;br /&gt;Expressed simply, the WTS will play a critical central role in ensuring that all digital content that is destined for our Digital Library is created, quality controlled and ingested accurately and efficiently into the Library’s repository.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-1336040179212875596?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/1336040179212875596/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=1336040179212875596&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1336040179212875596'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1336040179212875596'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/11/wellcome-library-releases-itt-for.html' title='Wellcome Library releases an ITT for a Workflow Tracking System'/><author><name>dnt</name><uri>http://www.blogger.com/profile/11218789008554869322</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='21' height='32' src='http://4.bp.blogspot.com/_M2W_1JyTKf8/SQgicvDXBgI/AAAAAAAAAAk/_9wEIMolmas/S220/dnt_digpres_awards_pic_2006.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-5135482456777746905</id><published>2010-11-24T14:15:00.002Z</published><updated>2010-11-25T08:53:18.503Z</updated><title type='text'>JPEG 2000 seminar - edited highlights #2</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_hR6lGOqlUv0/TOzmwfYP7AI/AAAAAAAAAMc/o52bkYFtIq8/s1600/JP2K_Johan.jpg"&gt;&lt;img style="float: left; margin: 0pt 10px 10px 0pt; cursor: pointer; width: 400px; height: 245px;" src="http://3.bp.blogspot.com/_hR6lGOqlUv0/TOzmwfYP7AI/AAAAAAAAAMc/o52bkYFtIq8/s400/JP2K_Johan.jpg" alt="" id="BLOGGER_PHOTO_ID_5543058962117553154" border="0" /&gt;&lt;/a&gt;This blog post continues my summary of the JPEG 2000 for the Practitioner Seminar (the edited highlights of the first five presentations can be seen in a previous &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2010/11/jpeg-2000-seminar-edited-highlights-1.html"&gt;blog post&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;Following Svein Arne Brygfjeld's discussion of the National Library of Norway's use of JPEG 2000, we had Saša Mutić, General Director of Geneza, speaking about the "&lt;span style="font-weight: bold;"&gt;Practical Usage of JP2 Files with Presentational Web Interface&lt;/span&gt;."Saša, based in Slovenia, gave an overview and demonstration of the delivery system &lt;a href="http://www.geneza.com/mediainfo.html"&gt;MediaINFO&lt;/a&gt; that uses a JPEG 2000 image server. This system is soon to be used by the National Library of Norway to deliver their digitised images. Some interesting features include the ability to easily share content, and to create "Personal Library" working spaces. There is also a demonstration of the system on &lt;a href="http://www.youtube.com/watch?v=fBIVnlDX1VE"&gt;YouTube&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Johan van der Knijff, from the Koninklijke Bibliotheek (National Library of the Netherlands), &lt;a href="http://www.dpconline.org/component/docman/doc_download/526-jp2knov2010vanderkniff"&gt;spoke &lt;/a&gt;about "&lt;span style="font-weight: bold;"&gt;JPEG 2000 for Long-Term Preservation in Practice: problems, challenges and possible solutions&lt;/span&gt;." He started off with an overview of the KB's investigations and use of JPEG 2000 (which started in 2007) and their current mass digitisation programme which will see the digitisation of around 14m images. Johan highlighted a number of issues with JPEG 2000 that although fairly minor in nature, are issues that should be addressed by either fixing deficiencies in the standard (particularly around colour profile support), or by changing the way software developers implement the standard (making sure that compressed files do indeed meet the standard). He stressed the importance of a strong user community and knowledge sharing as key to solving the remaining issues with the JPEG 2000 format.&lt;br /&gt;&lt;br /&gt;Gary Hodkinson, Managing Director of &lt;a href="http://www.luratech.com/en/home.html"&gt;LuraTech Ltd.&lt;/a&gt;, gave a &lt;a href="http://www.dpconline.org/component/docman/doc_download/523-jp2knov2010hodkinson"&gt;presentation &lt;/a&gt;entitled "&lt;span style="font-weight: bold;"&gt;Delivering High-Resolution JPEG2000 Images and Documents over the Internet&lt;/span&gt;." He provided a quick background of the company itself, which is German based, but has seen the recent establishment of a UK subsidiary. LuraTech's core business is document conversion and compression, and they supply a JP2 image compression tool called LuraWave. He gave an introductory background to image compression and image formats in general, what the key challenges are around compression, and how JPEG 2000 meets those challenges. He also gave further details on LuraWave, and the LuraTech Image Content Server, which works with JP2 to provide delivery of images to end users.&lt;br /&gt;&lt;br /&gt;As the final speaker of the day, Katty van Mele, from &lt;a href="http://www.intopix.com/"&gt;IntoPIX&lt;/a&gt;, gave an informative &lt;a href="http://www.dpconline.org/component/docman/doc_download/527-jp2knov2010vanmele"&gt;talk &lt;/a&gt;on "&lt;span style="font-weight: bold;"&gt;Pros and cons of JPEG 2000 for video archiving&lt;/span&gt;". She covered a wide range of moving image applications for JPEG 2000, in the cinema, broadcasting and cultural heritage world. JPEG 2000 is the only format currently in use for digital cinema, while broadcasters are still working toward agreeing a suitable long-term format (JPEG 2000 being a leading contender). Katty stressed the fact that massive amounts of material in moving image formats already in existence and continually being created makes long-term storage and preservation a very serious problem. JPEG 2000 is increasingly now seen as the solution to the storage problem, and a number of other problems as well, such as royalty payments currently required to use MPEG for example. IntoPIX provides solutions for converting JP2s, including hardware-based compressors that are orders of magnitude quicker than software-based compressors.&lt;br /&gt;&lt;br /&gt;During the course of the day, delegates were asked to write questions down and post them on whiteboards to raise during the final session of the day: questions and answers, moderated by Ben Gilbert, Photographer at the Wellcome Library. Ben posed the questions to the audience, alternating between technical issues (such as "What is the difference between a tile and a precinct"), to more philosophical questions (such as "Is it really feasible to store master archive images as lossy compressed files?"). This stimulated a good amount of discussion, which is impossible to adequately capture in a blog post!&lt;br /&gt;&lt;br /&gt;Many thanks go to William Kilbride of the DPC for putting all the presentations &lt;a href="http://www.dpconline.org/events/details/19-jpeg-2000-for-the-practioner?xref=19"&gt;online&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_hR6lGOqlUv0/TO4jzlcIs1I/AAAAAAAAAM0/FaOGehOAKYQ/s1600/jp2k10-coats.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 138px;" src="http://3.bp.blogspot.com/_hR6lGOqlUv0/TO4jzlcIs1I/AAAAAAAAAM0/FaOGehOAKYQ/s400/jp2k10-coats.jpg" alt="" id="BLOGGER_PHOTO_ID_5543407560470082386" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_hR6lGOqlUv0/TO4jquba7PI/AAAAAAAAAMs/9MhrUJUryFI/s1600/jp2k10-coats.jpg"&gt;&lt;br /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-5135482456777746905?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/5135482456777746905/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=5135482456777746905&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/5135482456777746905'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/5135482456777746905'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/11/jpeg-2000-seminar-edited-highlights-2.html' title='JPEG 2000 seminar - edited highlights #2'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_hR6lGOqlUv0/TOzmwfYP7AI/AAAAAAAAAMc/o52bkYFtIq8/s72-c/JP2K_Johan.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-6024730229971068382</id><published>2010-11-24T09:03:00.002Z</published><updated>2010-12-03T10:54:49.032Z</updated><title type='text'>JPEG 2000 seminar - edited highlights #1</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_hR6lGOqlUv0/TOpGCA-npDI/AAAAAAAAAME/N6agDCQR6AA/s1600/JP2K1_Simon2.JPG"&gt;&lt;img style="MARGIN: 0pt 10px 10px 0pt; WIDTH: 320px; FLOAT: left; HEIGHT: 313px; CURSOR: pointer" id="BLOGGER_PHOTO_ID_5542319291869144114" border="0" alt="" src="http://4.bp.blogspot.com/_hR6lGOqlUv0/TOpGCA-npDI/AAAAAAAAAME/N6agDCQR6AA/s320/JP2K1_Simon2.JPG" /&gt;&lt;/a&gt;&lt;span style="FONT-STYLE: italic"&gt;&lt;span id="SPELLING_ERROR_0" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 for the Practitioner&lt;/span&gt; seminar attracted a full house of 80+ delegates on 16 November at the &lt;span id="SPELLING_ERROR_1" class="blsp-spelling-error"&gt;Wellcome&lt;/span&gt; Trust.&lt;br /&gt;&lt;br /&gt;The aim of the seminar was to look at specific case studies of &lt;span id="SPELLING_ERROR_2" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 use, to explain technical issues that have an impact on practical implementation of the format, and explore the context of how and why organisations might choose to use &lt;span id="SPELLING_ERROR_3" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000. Follow the day as it unwound at Twitter &lt;a href="http://twitter.com/#search?q=%23jp2k10"&gt;#&lt;span id="SPELLING_ERROR_4" class="blsp-spelling-error"&gt;jp&lt;/span&gt;2k10&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Delegates were welcomed by &lt;a href="http://wellcomelibrary.blogspot.com/2010/02/new-head-of-wellcome-library.html"&gt;Simon Chaplin&lt;/a&gt;, Head of the &lt;a href="http://library.wellcome.ac.uk/"&gt;&lt;span id="SPELLING_ERROR_6" class="blsp-spelling-error"&gt;Wellcome&lt;/span&gt; Library&lt;/a&gt;, who briefly summarized the context of the &lt;span id="SPELLING_ERROR_7" class="blsp-spelling-error"&gt;Wellcome's&lt;/span&gt; digital library &lt;a href="http://library.wellcome.ac.uk/node350.html"&gt;ambitions&lt;/a&gt;. I (&lt;span id="SPELLING_ERROR_8" class="blsp-spelling-error"&gt;Christy&lt;/span&gt; &lt;span id="SPELLING_ERROR_9" class="blsp-spelling-error"&gt;Henshaw&lt;/span&gt;) gave a quick introduction to the &lt;a href="http://jp2k-uk.wikidot.com/"&gt;JP2K-UK&lt;/a&gt; group, and the origins of the seminar as one of the main outcomes from the group discussions. What follows is an edited highlights version of the talks given on the day; the full presentations are available on the &lt;span id="SPELLING_ERROR_10" class="blsp-spelling-error"&gt;DPC&lt;/span&gt; &lt;a href="http://www.dpconline.org/events/details/19-jpeg-2000-for-the-practioner?xref=19"&gt;website&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The first &lt;a href="http://www.dpconline.org/component/docman/doc_download/525-jp2knov2010tanner"&gt;talk&lt;/a&gt;, "&lt;span style="FONT-WEIGHT: bold"&gt;What did &lt;span id="SPELLING_ERROR_11" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 ever do for us?&lt;/span&gt;" was given by &lt;a href="http://www.kdcs.kcl.ac.uk/who/bios/simon-tanner.html"&gt;Simon Tanner&lt;/a&gt;, Director of &lt;a href="http://www.kdcs.kcl.ac.uk/index.html"&gt;King's Digital Consultancy Service&lt;/a&gt;. The fact of the matter, according to Simon, is that although &lt;span id="SPELLING_ERROR_12" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 is "cool and &lt;span id="SPELLING_ERROR_13" class="blsp-spelling-error"&gt;froody&lt;/span&gt;", and has a lot to offer in terms of functionality and &lt;span id="SPELLING_ERROR_14" class="blsp-spelling-corrected"&gt;intelligent&lt;/span&gt; format design, those who use it are doing so because it can save them money. The economic benefits can not be underestimated for large scale digitisation - even though storage is relatively cheap these days, the total cost of owning a million images is quite high. Storing master files as &lt;span id="SPELLING_ERROR_15" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000s can save an institution over £100,000 per year in terms of ongoing storage costs.&lt;br /&gt;&lt;br /&gt;Richard Clark, Managing Director of &lt;a href="http://www.elysium.ltd.uk/index.xalter"&gt;Elysium &lt;/a&gt;Ltd., gave an &lt;a href="http://www.dpconline.org/component/docman/doc_download/522-jp2knov2010clark"&gt;overview &lt;/a&gt;of the &lt;span id="SPELLING_ERROR_16" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 standard&lt;span style="FONT-WEIGHT: bold"&gt;, "&lt;span id="SPELLING_ERROR_17" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 Standardisation: A Practical Viewpoint&lt;/span&gt;." As the UK head of delegation to the &lt;a href="http://www.jpeg.org/committee.html"&gt;&lt;span id="SPELLING_ERROR_18" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; Committee&lt;/a&gt;, Richard has been involved with developing the standard since its inception. Richard ran through the key features and functionality that can be achieved with the &lt;span id="SPELLING_ERROR_19" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 format (and its many parts), and explained the rationale behind the standard. He quoted the original objective, which was to develop an "architecturally based standard" that would enable flexibility for a wide range of uses, and he demonstrated that this was, in fact achieved. Although &lt;span id="SPELLING_ERROR_20" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 has a lot to offer the cultural heritage industry, that industry has not been well represented on the standards committees.&lt;br /&gt;&lt;br /&gt;The next hour was taken up with the "&lt;span style="FONT-WEIGHT: bold"&gt;Profiles&lt;/span&gt;" session. Sean Martin, Head of Architecture and Development at the &lt;a href="http://www.bl.uk/"&gt;British Library&lt;/a&gt;, &lt;a href="http://www.dpconline.org/component/docman/doc_download/524-jp2knov2010martin"&gt;kicked off&lt;/a&gt; with a description of the JP2 profile (i.e. the specific parameter settings) to be used for the British Library's newspapers project. Key to point out here is that the British Library has opted for &lt;span id="SPELLING_ERROR_21" class="blsp-spelling-error"&gt;lossy&lt;/span&gt; compression for its archival masters, stating that "it is also desirable that the same master file support the needs for both long term archival and also access." &lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_hR6lGOqlUv0/TOpGrVnKEkI/AAAAAAAAAMU/zG4hK87YfR8/s1600/JP2K1_Christy.JPG"&gt;&lt;img style="MARGIN: 0pt 10px 10px 0pt; WIDTH: 200px; FLOAT: left; HEIGHT: 186px; CURSOR: pointer" id="BLOGGER_PHOTO_ID_5542320001782518338" border="0" alt="" src="http://4.bp.blogspot.com/_hR6lGOqlUv0/TOpGrVnKEkI/AAAAAAAAAMU/zG4hK87YfR8/s200/JP2K1_Christy.JPG" /&gt;&lt;/a&gt;I followed with a brief summary of the compression aspects of the &lt;span id="SPELLING_ERROR_22" class="blsp-spelling-error"&gt;Wellcome&lt;/span&gt; Library's profile (our JP2 profile is available &lt;a href="http://library.wellcome.ac.uk/assets/wtx056572.pdf"&gt;online&lt;/a&gt;), and how we determine the &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2010/08/as-result-of-our-decision-to-go-lossy.html"&gt;right level &lt;/a&gt;of compression. Like the British Library, we use &lt;span id="SPELLING_ERROR_23" class="blsp-spelling-error"&gt;lossy&lt;/span&gt; compression for our archival masters, and will use the same file for providing access. &lt;span id="SPELLING_ERROR_24" class="blsp-spelling-error"&gt;Bedrich&lt;/span&gt; &lt;span id="SPELLING_ERROR_25" class="blsp-spelling-error"&gt;Vychodil&lt;/span&gt; &lt;a href="http://www.dpconline.org/component/docman/doc_download/520-jp2knov2010bedrich"&gt;presented &lt;/a&gt;the new JP2 profiles for the &lt;a href="http://www.nkp.cz/_en/index.php3"&gt;National Library of the Czech Republic&lt;/a&gt; that will soon come into force for a wide range of materials. In contrast to the British Library and the &lt;span id="SPELLING_ERROR_26" class="blsp-spelling-error"&gt;Wellcome&lt;/span&gt;, the Czech National Library will use a different, &lt;span id="SPELLING_ERROR_27" class="blsp-spelling-error"&gt;lossless&lt;/span&gt;, profile for their archival masters, and a &lt;span id="SPELLING_ERROR_28" class="blsp-spelling-error"&gt;lossy&lt;/span&gt; profile for their access files. Delegates were provided with a list of these parameter settings, as well as several others, available &lt;a href="http://www.dpconline.org/component/docman/doc_download/529-jp2knov2010parametercomparisonchart"&gt;online&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Petr &lt;span id="SPELLING_ERROR_29" class="blsp-spelling-error"&gt;Zabicka&lt;/span&gt; &lt;a href="http://www.dpconline.org/component/docman/doc_download/528-jp2knov2010zabicka"&gt;spoke &lt;/a&gt;about "&lt;span style="FONT-WEIGHT: bold"&gt;&lt;span id="SPELLING_ERROR_30" class="blsp-spelling-error"&gt;IIPImage&lt;/span&gt; and &lt;span id="SPELLING_ERROR_31" class="blsp-spelling-error"&gt;OldMapsOnline&lt;/span&gt;&lt;/span&gt;", a &lt;a href="http://help.oldmapsonline.org/Home"&gt;development project&lt;/a&gt; carried out by the &lt;a href="http://www.mzk.cz/eng/knihovne/"&gt;Moravian Library&lt;/a&gt; in the Czech Republic that uses &lt;span id="SPELLING_ERROR_32" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 to display large images, in particular maps. The imaging server they have devised is based on &lt;span id="SPELLING_ERROR_33" class="blsp-spelling-error"&gt;IIPImage&lt;/span&gt; and uses the tiles encoded into the &lt;span id="SPELLING_ERROR_34" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 format to provide speedy access to portions of the image when zooming and panning. More uniquely, they have developed a &lt;span id="SPELLING_ERROR_35" class="blsp-spelling-error"&gt;georeferencing&lt;/span&gt; application that allows the user to match points on historic maps with those on Google maps, and to overlay - and correct - old maps using the Google maps &lt;span id="SPELLING_ERROR_36" class="blsp-spelling-error"&gt;API&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_hR6lGOqlUv0/TOpGN4cg2QI/AAAAAAAAAMM/YBeK-NUbfno/s1600/JP2K1_Svein%2BArne.JPG"&gt;&lt;img style="MARGIN: 0pt 10px 10px 0pt; WIDTH: 320px; FLOAT: left; HEIGHT: 240px; CURSOR: pointer" id="BLOGGER_PHOTO_ID_5542319495737039106" border="0" alt="" src="http://4.bp.blogspot.com/_hR6lGOqlUv0/TOpGN4cg2QI/AAAAAAAAAMM/YBeK-NUbfno/s320/JP2K1_Svein%2BArne.JPG" /&gt;&lt;/a&gt;&lt;br /&gt;After a well-deserved lunch, delegates heard &lt;span id="SPELLING_ERROR_37" class="blsp-spelling-error"&gt;Svein&lt;/span&gt; Arne &lt;span id="SPELLING_ERROR_38" class="blsp-spelling-error"&gt;Brygfjeld&lt;/span&gt; from the &lt;a href="http://www.nb.no/english"&gt;National Library of Norway&lt;/a&gt; &lt;a href="http://www.dpconline.org/component/docman/doc_download/521-jp2knov2010brygfjeld"&gt;speak &lt;/a&gt;about "&lt;span style="FONT-WEIGHT: bold"&gt;Implementing JP2K for &lt;span id="SPELLING_ERROR_39" class="blsp-spelling-error"&gt;Preserv&lt;/span&gt;...&lt;/span&gt;" (his title was abbreviated in order to fit a picture of a glacier on the slide, but I am led to believe the title ended with "&lt;span style="FONT-WEIGHT: bold"&gt;..&lt;span id="SPELLING_ERROR_40" class="blsp-spelling-error"&gt;ation&lt;/span&gt; and access, experiences from the National Library of Norway&lt;/span&gt;". The glacier provides a key to the talking point of &lt;span id="SPELLING_ERROR_41" class="blsp-spelling-error"&gt;Svein&lt;/span&gt; Arne's presentation - extremes. Located in the Arctic Circle, at Mo i Rana, the &lt;span id="SPELLING_ERROR_42" class="blsp-spelling-error"&gt;NLN&lt;/span&gt; is carrying out mass digitisation of newspapers and other materials, and has recently decided to store their master files as &lt;span id="SPELLING_ERROR_43" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 &lt;span id="SPELLING_ERROR_44" class="blsp-spelling-error"&gt;lossless&lt;/span&gt; files. Digitisation is such a large part of what the &lt;span id="SPELLING_ERROR_45" class="blsp-spelling-error"&gt;NLN&lt;/span&gt; does, that around 30% of the workforce is involved in digitisation.&lt;br /&gt;&lt;br /&gt;Stay tuned for &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2010/11/jpeg-2000-seminar-edited-highlights-2.html"&gt;more edited highlights&lt;/a&gt; covering the second half of the seminar...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-6024730229971068382?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/6024730229971068382/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=6024730229971068382&amp;isPopup=true' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/6024730229971068382'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/6024730229971068382'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/11/jpeg-2000-seminar-edited-highlights-1.html' title='JPEG 2000 seminar - edited highlights #1'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_hR6lGOqlUv0/TOpGCA-npDI/AAAAAAAAAME/N6agDCQR6AA/s72-c/JP2K1_Simon2.JPG' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-3083132455003621741</id><published>2010-10-18T13:28:00.005+01:00</published><updated>2010-11-02T09:15:09.326Z</updated><title type='text'>JPEG 2000 seminar - draft programme now available</title><content type='html'>Places are still available on the JPEG 2000 &lt;a href="http://www.dpconline.org/events/details/19-jpeg-2000-for-the-practioner?xref=19"&gt;seminar &lt;/a&gt;to be held at the Wellcome Trust on 16 November.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Draft programme with timetable and confirmed speakers:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;09:00 Registration, coffee&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;10:00&lt;/span&gt; &lt;span style="font-weight: bold;"&gt;Welcome, introduction&lt;/span&gt;&lt;br /&gt;Christy Henshaw, Wellcome Library, Chair of JP2K-UK&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Morning session&lt;/span&gt; Chair: William Kilbride, Executive Director, Digital Preservation Coalition&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;10:10 What did JPEG 2000 ever do for us?&lt;/span&gt;&lt;br /&gt;Simon Tanner, Director, Kings Digital Consultancy Service&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;10:40&lt;/span&gt; &lt;span style="font-weight: bold;"&gt;JPEG 2000 standardization - a pragmatic viewpoint&lt;/span&gt;&lt;br /&gt;Richard Clark, UK head of delegation to JPEG and MD of Elysium Ltd.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;11:10 JPEG 2000 profiles&lt;/span&gt;&lt;br /&gt;Five ten-minute presentations moderated by Sean Martin, Head of Architecture and Development, British Library&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;12:10 IIPImage and OldMapsOnline&lt;/span&gt;&lt;br /&gt;Petr Zabicka, Head of R&amp;amp;D, Moravian Library, Czech Republic&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;12:40 LUNCH&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Early afternoon session&lt;/span&gt; Chair: Dave Thompson, Digital Curator, Wellcome Library&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;13:40&lt;/span&gt; &lt;span style="font-weight: bold;"&gt;JP2K for preservation and access, experiences from the National Library of  Norway&lt;/span&gt;&lt;br /&gt;Svein Arne Brygfjeld, National Library of Norway&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;14:10 Web presentation of JPEG 2000 images&lt;/span&gt;&lt;br /&gt;Sasa Mutic, Geneza and Ivo Iossiger, 4DigitalBooks, Switzerland&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;14:40 JPEG 2000 for long-term preservation in practice: problems, challenges and possible solutions&lt;/span&gt;&lt;br /&gt;Johan van der Knijff, Koninklijke Bibliotheek (NL)&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;15:10 Coffee&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Late afternoon session&lt;/span&gt; Chair: Simon Tanner, Director, Kings Digital Consultancy Service&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;15:40&lt;/span&gt; &lt;span style="font-weight: bold;"&gt;Delivering High-Resolution JPEG2000 Images and Documents over the Internet&lt;/span&gt;&lt;br /&gt;Gary Hodkinson, MD of Luratech Ltd.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;16:10 Pros and Cons of JPEG 2000 for video archiving&lt;/span&gt;&lt;br /&gt;Katty van Mele, IntoPIX&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;16:40&lt;/span&gt; &lt;span style="font-weight: bold;"&gt;Questions and discussion&lt;/span&gt;&lt;br /&gt;Moderated by Ben Gilbert, Photographer, Wellcome Library&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;17:10 Concluding remarks&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-3083132455003621741?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/3083132455003621741/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=3083132455003621741&amp;isPopup=true' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/3083132455003621741'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/3083132455003621741'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/10/jpeg-2000-seminar-draft-programme-now.html' title='JPEG 2000 seminar - draft programme now available'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-1374986912097040490</id><published>2010-10-01T09:02:00.011+01:00</published><updated>2010-10-01T14:27:57.768+01:00</updated><title type='text'>Guest post: Examining losses, a simple Photoshop technique for evaluating lossy-compressed images</title><content type='html'>&lt;em&gt;Bill Comstock, Head of Imaging Services at Harvard College Library, writes a second post about using Photoshop to evaluate lossy compressed images.&lt;/em&gt;&lt;br /&gt;&lt;br /&gt;If you decide to employ JPEG2000’s lossy compression scheme, you will also have to determine the degree to which you are willing to compress your files; you’ll have to work to identify that magic spot where you realize a perfect balance between file size reduction and the preservation of image quality.&lt;br /&gt;&lt;br /&gt;Of course, there is no magic spot, no perfect answer -- not for any single image, and certainly not for the large batches of images that you will want to process using a single compression recipe. Regardless of whether you decide to control the application of compression by setting the compression ratio, using a software-specific “quality” scale, or by signal-to-noise ratio, you will want to test a variety of settings on a range of images, scrutinize the results, and then decide where to set your software.&lt;br /&gt;&lt;br /&gt;Below I describe a Photoshop technique for overlaying an original, uncompressed source image, with a compressed version of the image to measure the difference between the two, and to draw your attention to regions where the compressed version of the image differs most significantly from the source image. Credit for the technique belongs to &lt;a href="http://www.pixelgenius.com/bios/bruce_bio.html"&gt;Bruce Fraser&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;1. First, open up the two images that you want to compare (the original source image, and the compressed JP2 derivative) in Photoshop.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://3.bp.blogspot.com/_hR6lGOqlUv0/TKXg_t-RiKI/AAAAAAAAALM/JroV-tAs9rI/s1600/image_apply_01.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 320px; height: 294px;" src="http://3.bp.blogspot.com/_hR6lGOqlUv0/TKXg_t-RiKI/AAAAAAAAALM/JroV-tAs9rI/s320/image_apply_01.jpg" alt="" id="BLOGGER_PHOTO_ID_5523067903316953250" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;2. Next, go to the “Image” menu and select “Apply Image”.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_hR6lGOqlUv0/TKXhRZb9khI/AAAAAAAAALU/E88WQ59c-Tc/s1600/image_apply_02.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 256px; height: 320px;" src="http://2.bp.blogspot.com/_hR6lGOqlUv0/TKXhRZb9khI/AAAAAAAAALU/E88WQ59c-Tc/s320/image_apply_02.jpg" alt="" id="BLOGGER_PHOTO_ID_5523068207041974802" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;3. Set Blending to “Subtract”; Scale to “1”; and Offset to “128.“&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_hR6lGOqlUv0/TKXha4S1_vI/AAAAAAAAALc/myxXUGNbuSQ/s1600/image_apply_03.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 320px; height: 261px;" src="http://4.bp.blogspot.com/_hR6lGOqlUv0/TKXha4S1_vI/AAAAAAAAALc/myxXUGNbuSQ/s320/image_apply_03.jpg" alt="" id="BLOGGER_PHOTO_ID_5523068369944051442" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;4. The differences between the two images are now visible (you may need to magnify the image beyond 100%), and the standard deviation between the two copies can be displayed on the Histogram panel.&lt;br /&gt;&lt;br /&gt;(A standard deviation of zero indicates that the two copies are identical and that the compressed version was losslessly compressed.)&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_hR6lGOqlUv0/TKXhwmFO8AI/AAAAAAAAALk/oEM8vBLtc-0/s1600/image_apply_04.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 320px; height: 213px;" src="http://4.bp.blogspot.com/_hR6lGOqlUv0/TKXhwmFO8AI/AAAAAAAAALk/oEM8vBLtc-0/s320/image_apply_04.jpg" alt="" id="BLOGGER_PHOTO_ID_5523068743012249602" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Another option: &lt;/span&gt;You can also create a two layer image in PS where one layer is the source image, the second layer is the compressed copy, and by setting the blending option to “difference”. You may find the technique described in detail above preferable, if only because it makes the variance between the two copies more easily visible by shifting the pixel-to-pixel differences into the middle gray region.&lt;br /&gt;&lt;br /&gt;Within the group that I manage, we modulate compression using PSNR. We test each candidate setting on a large number of images and then examine some number of the least and most compressed images in the set. We repeat the process until we have zeroed in on what seems to be the best setting.&lt;br /&gt;&lt;br /&gt;Good luck!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-1374986912097040490?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/1374986912097040490/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=1374986912097040490&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1374986912097040490'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1374986912097040490'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/10/guest-post-examining-losses-simple.html' title='Guest post: Examining losses, a simple Photoshop technique for evaluating lossy-compressed images'/><author><name>dnt</name><uri>http://www.blogger.com/profile/11218789008554869322</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='21' height='32' src='http://4.bp.blogspot.com/_M2W_1JyTKf8/SQgicvDXBgI/AAAAAAAAAAk/_9wEIMolmas/S220/dnt_digpres_awards_pic_2006.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_hR6lGOqlUv0/TKXg_t-RiKI/AAAAAAAAALM/JroV-tAs9rI/s72-c/image_apply_01.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-1113837777380583210</id><published>2010-09-20T14:52:00.007+01:00</published><updated>2010-09-21T08:49:25.930+01:00</updated><title type='text'>Calling all JPEG 2000 profiles</title><content type='html'>We plan to provide delegates of our JPEG 2000 &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2010/08/jpeg-2000-for-practitioner-free-one-day.html"&gt;Seminar (&lt;/a&gt;16 Nov) with a list of JPEG 2000 profiles from a number of organisations who are currently using the format. Some of these will be briefly presented during a "Profiles" session on the day.&lt;br /&gt;&lt;br /&gt;Do you have a profile you are currently using, and would like to distribute to your peers? If so, please send the following details to Christy at c.henshaw@wellcome.ac.uk.&lt;br /&gt;&lt;br /&gt;The specific information we're looking for includes:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Used for:&lt;/span&gt; (e.g Newspapers)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Conversion software used: &lt;/span&gt;(e.g. Kakadu)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;File format: &lt;/span&gt;(e.g. Part 1 (.jp2)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Lossy or lossless:&lt;/span&gt; (choose)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Typical compression: &lt;/span&gt;(expressed as a ratio)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Tiling: &lt;/span&gt;(e.g. 1024 x 1024)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Progression order:&lt;/span&gt; (e.g. RPCL)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;No. of decomposition levels: &lt;/span&gt;(e.g. 6)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Number of quality layers:&lt;/span&gt; (e.g. 12)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Code block size (xcb = yxb)&lt;/span&gt;: (e.g. 6)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Transformation&lt;/span&gt;: (e.g. 9-7 irreversible filter)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Precinct size&lt;/span&gt;: (e.g. 128 x 128)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Regions of interest&lt;/span&gt;: (yes or no)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Code block size&lt;/span&gt; : (e.g. 64 x 64)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;TLM markers&lt;/span&gt;: (yes or no)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-1113837777380583210?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/1113837777380583210/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=1113837777380583210&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1113837777380583210'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1113837777380583210'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/09/calling-all-jpeg-2000-profiles.html' title='Calling all JPEG 2000 profiles'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-4587548830142712769</id><published>2010-09-14T08:47:00.004+01:00</published><updated>2010-09-14T08:55:08.613+01:00</updated><title type='text'>New Wellcome Digital Library blog</title><content type='html'>The Wellcome Library has launched a new blog (&lt;a href="http://wellcomedigitallibrary.blogspot.com/"&gt;wellcomedigitallibrary.blogspot.com/&lt;/a&gt;), centered on the development of the Wellcome Digital Library. The blog will be a "a real-time progress report, discussion outlet, and notification area."&lt;br /&gt;&lt;br /&gt;This JPEG 2000 blog will still remain focused specifically on the work being done around JPEG 2000 at the Library, but the Wellcome Digital Library blog will provide much broader information on the programme, including:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;What will be digitised, and how the content will be of use to researchers.&lt;/li&gt;&lt;li&gt;How we will facilitate research activity, learning, and discovery.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Logistics of digitisation and workflows.&lt;/li&gt;&lt;li&gt;In-house vs. outsource options.&lt;/li&gt;&lt;li&gt;Metadata.&lt;/li&gt;&lt;li&gt;Long-term data management.&lt;/li&gt;&lt;li&gt;Delivery formats, speeds, and functions.&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-4587548830142712769?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/4587548830142712769/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=4587548830142712769&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/4587548830142712769'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/4587548830142712769'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/09/new-wellcome-digital-library-blog.html' title='New Wellcome Digital Library blog'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-2283465806206824499</id><published>2010-09-10T08:56:00.004+01:00</published><updated>2010-09-10T09:48:57.377+01:00</updated><title type='text'>Guest post: JPEG2000 recipes for the Aware encoder</title><content type='html'>&lt;span style="font-style: italic;font-size:100%;" &gt;As our first guest poster, Bill Comstock, Head of &lt;a href="http://imaging.harvard.edu/"&gt;Imaging Services&lt;/a&gt; at Harvard College Library, writes about the specific "recipes" used at Harvard for producing JPEG2000 images.&lt;/span&gt;&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;br /&gt;I needed help remembering when it was that we began making JPEG2000 images. I ran a search against the Harvard Library’s preservation digital repository, DRS, and it looks like we first deposited a JP2 image in 2004.&lt;br /&gt;&lt;br /&gt;Over the intervening six years, we’ve refined and settled on a single recipe that we use to produce lossy-compressed images, and another that we use to produce losslessly-compressed JP2s. I’ll share these recipes with you below. My reasons for sharing are two:&lt;br /&gt;&lt;br /&gt;1) Depending upon the software used, there are many encoding combinations to consider - many more than one would have to consider when cooking up the more familiar and less complex TIFF. My group uses the &lt;a href="http://www.aware.com/imaging/jpeg2000.htm"&gt;Aware JPEG2000 SDK&lt;/a&gt; encoder. Flipping through Aware’s 330 page manual (“AccuRad J2KSuite Developer’s Guide”), I count...152 different command line options. In sharing our recipes (the combination of options and parameters that we invoke ), I’d like to speed others along in developing their own encoding formulations and JP2 production workflows.&lt;br /&gt;&lt;br /&gt;2) Not only are there many encoding options to consider, but some are complex and a bit intimidating. Honestly, I don’t know what the “--set-input-raw-channel-subsampling” or “--set-output-j2k-rd-slope” options do. I do think that I understand the options that we use, but I may hear from one of you that my understanding is flawed and that our recipe could be improved upon, or at least better understood and explained. Here you go.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Lossless encoding (Windows command line)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;j2kdriver.exe --set-input-image &lt;/span&gt;&lt;input filename=""  style="font-family:courier new;"&gt;&lt;span style="font-family:courier new;"&gt; --set-output-j2k-color-xform YES --set-output-j2k-error-resilience ALL --wavelet-transform R53 --set-output-j2k-bitrate 0 --set-output-j2k-progression-order RLCP --tile-size 1024 1024 --output-file-type JP2 --output-file-name &lt;/span&gt;&lt;output filename=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Notes on individual options:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/output&gt;&lt;/span&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;   &lt;span style="font-family:courier new;"&gt;“--set-input-image&lt;/span&gt; &lt;input filename=""&gt;” reads file into the encoder's memory-buffer and auto-detects the input file format&lt;/output&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output style="font-family: courier new;" filename=""&gt;“-- set output-j2k-color-xform YES”&lt;/output&gt;     I believe that one does not need to call this option explicitly; YES seems to be the default value. The transform referred to here is a colorspace transformation from RGB to YUV. This conversion is made prior to compressing the data. Applying compression to YUV data is more efficient, yielding smaller files than the same compression applied to the unconverted RGB data.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul  style="font-family:courier new;"&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;“&lt;span style="font-family:courier new;"&gt;--set-output-j2k-error-resilience ALL&lt;/span&gt;”&lt;/output&gt;&lt;span style="font-family:georgia;"&gt; This function will take the following parameters, each explained below by Aware’s Alexis Tzannes.&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt; SOP to enable Start of Packet markers&lt;/output&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;EPH to enable End of Packet Header markers&lt;/output&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;   SEG to enable segmentation symbols&lt;/output&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;ALL to enable all of the above&lt;/output&gt;&lt;output filename=""&gt;&lt;br /&gt;&lt;/output&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;NONE to disable all of the options&lt;/output&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;&lt;span style="font-weight: bold;"&gt;Resynchronization markers: &lt;/span&gt;Start of Packet (SOP), End of Packet Headers (EPH).  These are used to signal the beginning of each packet and the end of each packet header, and can be used to resynchronize in the case of missing or corrupted data.  This allows the decoder to detect and discard entire corrupted packets.&lt;/output&gt; So the SOP and EPH are basically tags that signal the beginning and end of a packet (a piece of coded data) in the file. If a packet gets corrupted the error resilient decoder can resync using the next SOP packet marker. The idea here is that if one packet in a file is bad, we don't lose everything that comes after it (as was the case with original JPEG). With JPEG 2000, you could lose a packet and have a small area of the image go bad, but the decoder can recover and keep on decoding.&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;&lt;span style="font-weight: bold;"&gt;Segmentation symbols:&lt;/span&gt; this adds a special four symbol code to specific locations in the compressed data stream, enabling error resilient decoders to detect errors, if this symbol is corrupted.  This allows the decoder to detect and discard corrupted bitplanes.&lt;/output&gt; So this is similar, but less granular, as it operates at the bitplane level, each bitplane may include multiple packets. Overall, these features would be useful in noise prone environments or over unreliable networks.&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;“&lt;span style="font-family:courier new;"&gt;--wavelet-transform R53&lt;/span&gt;” specifies use of the reversible "integer 5-3 filter" (compression) to produce a losslessly encoded JPEG2000 file.&lt;/output&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;   &lt;/output&gt;&lt;/span&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;“&lt;span style="font-family:courier new;"&gt;--set-output-j2k-bitrate 0&lt;/span&gt;”&lt;/output&gt; Quoting from the “AccuRad J2KSuite Developer’s Guide”: “Sets the output image bitrate, in bits per pixel. A bitrate of 0 indicates that all the quantized data should be included in the image. This creates lossless images if the R53 wavelet is chosen Sets the output image bitrate, in bits per pixel. A bitrate of 0 indicates that all the quantized data should be included in the image. This creates lossless images if the R53 wavelet is chosen [...].”&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output style="font-family: courier new;" filename=""&gt;“--progression-order RLCP”&lt;/output&gt; “For a given tile, the packets contain data from a specific layer, a specific component, a specific resolution, and a specific precinct. The order in which these packets are interleaved is called the progression order. The interleaving of the packets can progress along four axes: layer, component, resolution and precinct.”  [1] A progression order that begins with “R” (resolution) indicates that the data is organized so that low resolution information will be decoded first, followed and augmented by the remaining higher resolution data in the codestream.&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;“&lt;span style="font-family:courier new;"&gt;--tile-size 1024 1024&lt;/span&gt;”&lt;/output&gt; This tile size was prescribed, as it was said to be optimally matched to the software our library uses to dynamically generate and deliver JPEG files from stored JP2 masters.&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Lossy encoding&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;j2kdriver.exe --set-input-image-file &lt;/span&gt;&lt;input filename=""  style="font-family:courier new;"&gt;&lt;span style="font-family:courier new;"&gt; --set-output-j2k-color-xform YES --set-output-j2k-error-resilience ALL --wavelet-transform I97 --set-output-j2k-progression-order RLCP --set-output-j2k-psnr 46 --tile-size 1024 1024 --output-file-type JP2 --output-file-name&lt;/span&gt; &lt;output filename=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;/output&gt;&lt;/output&gt;&lt;/span&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;&lt;output filename=""&gt;   “&lt;span style="font-family:courier new;"&gt;--wavelet-transform I97&lt;/span&gt;” specifies use of the “irreversible  9-7 filter” to produce a lossy encoded JPEG2000 file.&lt;/output&gt;&lt;/output&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;&lt;output filename=""&gt; &lt;span style="font-family:courier new;"&gt;“--set-output-j2k-psnr 46”&lt;/span&gt;&lt;/output&gt;&lt;/output&gt; The pSNR function was selected because it effectively, although imperfectly, modulates the level of compression applied to each image based on the image's particular characteristics: the arrangement and variability of raster values. When we set the pSNR value to 46 db for the page-images that we create, we've come to expect a very high-quality encoded image. There are cases (certain kinds of photographs, illustrations with fine thin lines) where a 46 db setting would produce a too heavily compressed file. Too guard against over-compression, we have developed an effective (although a bit crude) method for dynamically resetting the db value if the file appears to have been too heavily compressed. This is a topic for another day.&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;&lt;output filename=""&gt;      I believe that most users set a fixed compression ratio, e.g., “&lt;span style="font-family:courier new;"&gt;--set-output-j2k-ratio&lt;/span&gt; &lt;ratio&gt;”.&lt;/ratio&gt;&lt;/output&gt;&lt;/output&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-size:100%;"&gt;&lt;output filename=""&gt;&lt;output filename=""&gt;&lt;ratio&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Creating a Windows batch file&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;If you would like to run your encoding script over a directory of TIFF images (for example), you can create a simple batch file.&lt;br /&gt;&lt;br /&gt;Example:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;for %%f in (*.tif) do j2kdriver.exe --set-input-image-file "%%f" --set-output-j2k-error-resilience ALL --set-output-j2k-progression-order RLCP --set-output-j2k-ratio 8 --tile-size 1024 1024 --output-file-type JP2 --output-file-name "%%~nf.jp2"&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Good luck putting together your own recipes and workflows. Again, please let me know if you have any suggestions for improving our practice.&lt;br /&gt;&lt;br /&gt;[1] SO/IEC  JTC1/SC29 WG1, JPEG 2000 Editor Martin Boliek, Co-editors Charilaos, C.  and E.Majani. "JPEG 2000 Part I Final Committee Draft Version 1.0.",  2000, &lt;a href="http://www.jpeg.org/public/fcd15444-4.pdf"&gt;http://www.jpeg.org/public/fcd15444-4.pdf&lt;/a&gt; (accessed September 3,  2010).&lt;span style="font-style: italic;"&gt;&lt;br /&gt;&lt;br /&gt;Bill Comstock&lt;br /&gt;Head, Imaging Services&lt;br /&gt;Harvard College Library&lt;br /&gt;Widener, D70C&lt;br /&gt;Harvard Yard&lt;br /&gt;Cambridge, MA 02138&lt;br /&gt;&lt;a href="http://imaging.harvard.edu/"&gt;http://imaging.harvard.edu/&lt;/a&gt;&lt;/span&gt;&lt;/ratio&gt;&lt;/output&gt;&lt;/output&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-2283465806206824499?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/2283465806206824499/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=2283465806206824499&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/2283465806206824499'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/2283465806206824499'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/09/guest-post-jpeg2000-recipes-for-aware.html' title='Guest post: JPEG2000 recipes for the Aware encoder'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-3319768918544298017</id><published>2010-08-27T08:22:00.002+01:00</published><updated>2010-08-27T08:26:08.890+01:00</updated><title type='text'>JPEG 2000 for the practitioner - free one-day seminar</title><content type='html'>This recently announced call for papers/registration may be of interest to our readers:&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;JPEG 2000 for the practitioner - a one-day seminar&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;A free seminar to explore and examine the use of JPEG 2000 in the cultural heritage industry will be held at the Wellcome Trust. The seminar will include specific case studies of JPEG 2000 use. It will explain technical issues that have an impact on practical implementation of the format, and explore the context of how and why organisations may choose to use JPEG 2000. Although the seminar will have an emphasis on digitisation and digital libraries, the papers will be relevant to a range of research and creative industries. Places are limited to 80 attendees. Papers will be made available online after the event.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Tuesday 16 November 2010&lt;br /&gt;&lt;/strong&gt;9am - 5pm&lt;br /&gt;Wellcome Trust, 215 Euston Road, London, UK&lt;br /&gt;&lt;br /&gt;This seminar is hosted by the &lt;a href="http://jp2k-uk.wikidot.com/"&gt;JPEG 2000 Implementation Working Group &lt;/a&gt;and the &lt;a href="http://library.wellcome.ac.uk/"&gt;Wellcome Library&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Contributors:&lt;/strong&gt; please submit the title and a brief abstract of your proposed paper and a bio of the speaker/s to c.henshaw@wellcome.ac.uk by October 4, 2010.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Delegates:&lt;/strong&gt; if you would like to attend please email your name and the name of your institution to c.henshaw@wellcome.ac.uk by 1 November, 2010.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-3319768918544298017?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/3319768918544298017/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=3319768918544298017&amp;isPopup=true' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/3319768918544298017'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/3319768918544298017'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/08/jpeg-2000-for-practitioner-free-one-day.html' title='JPEG 2000 for the practitioner - free one-day seminar'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-1498688687175887369</id><published>2010-08-24T13:53:00.002+01:00</published><updated>2010-08-24T13:58:48.347+01:00</updated><title type='text'>Determining rates of JPEG 2000 compression on a collection-by-collection basis</title><content type='html'>As a result of our decision to "go &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;lossy&lt;/span&gt;", we need to make sure that the level of &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;lossiness&lt;/span&gt; is appropriate to the image content. We can't do this on the individual image level, as there are simply too many images. But we can do this on the collection level. We came up with a rule of thumb:&lt;br /&gt;&lt;br /&gt;For any given collection of physical formats we  will apply  a  range of  different compressions on a representative sample  from that collection . We  will continue compressing at regular intervals until visual &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;artefacts&lt;/span&gt; began to appear on any individual image (i.e. 2:1, 4:1, 6:1, and so on).&lt;br /&gt;Once we determined at which compression level the worst-performing image began to show visual &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;artefacts&lt;/span&gt;, we  will choose the next lowest compression level (if the worst-performing image showed &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;artefacts&lt;/span&gt; at 10:1, we would chose 6:1) and apply that to the entire collection, regardless of how much more compression other material types in that collection might bear.&lt;br /&gt;This rule of thumb allowed us to strike a balance between storage savings and the time and effort in assessing compression levels for a large number of images.&lt;br /&gt;&lt;br /&gt;The first "real life" test of this methodology was carried out in relation to our archives digitisation &lt;a href="http://library.wellcome.ac.uk/doc_WTX057852.html"&gt;project&lt;/a&gt;. We are currently digitising a series of paper archives (letters, notebooks, photos, invitations, memos, etc.) in-house. The scope runs to something like half a million images over a couple of years, and includes the papers of some notable individuals and organisations (Francis Crick being the foremost of these). Archives can be quite &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_5"&gt;miscellaneous&lt;/span&gt; in the types of things that you find, but different collections within the archives tend to contain a similar range of materials. This presents a problem if you want to treat images differently depending on their content. The photographer doesn't know, from one file of material to the next, what sort of content they will be handling. So even for a miscellaneous collection, once the image count gets high enough, you have to make the compromise by taking a collection-level decision on compression rates.&lt;br /&gt;&lt;br /&gt;For archival collections we needed to test things like faint pencil marks on a notebook page, typescript on translucent letter paper, black and white photos, printed matter, newsprint, colour drawings, and so on. We chose 10 samples for the test. As this was our first test, and we were curious just how far we could go for some of the material types in our sample, we started with 1:1 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_6"&gt;lossy&lt;/span&gt; compression and increased this to 100:1. We used &lt;a href="http://www.luratech.com/products/imaging-solutions.html"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_7"&gt;LuraWave&lt;/span&gt;&lt;/a&gt; for this testing.&lt;br /&gt;&lt;br /&gt;For the archives, the compression intervals were: 1:1 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_8"&gt;lossy&lt;/span&gt;, 2:1, 4:1, 6:1, 10:1, 25:1, 50:1, and 100:1. The idea is that at 2:1, the compression will reduce the file size by half in comparison to the source TIFF, and so on.&lt;br /&gt;&lt;br /&gt;Not surprisingly, the biggest drop in &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_9"&gt;filesize&lt;/span&gt; was seen in converting from TIFF to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_10"&gt;JPEG&lt;/span&gt; 2000 in the first place. At a 1:1 compression rate, this reduced the average &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_11"&gt;filesize&lt;/span&gt; by 86% (ranging from 67% to 95%). A 2.1 compression resulted in no &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_12"&gt;noticeable&lt;/span&gt; drop in &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_13"&gt;filesize&lt;/span&gt; from 1:1 - begging the question what differences there could possible be between 1:1 and 2.1 in the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_14"&gt;LuraWave&lt;/span&gt; software. At the average file size  (5&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_15"&gt;mb&lt;/span&gt;)  at this compression  (2:1) , a 500,000 image repository (our estimate for the archives project) would require 2.4 Tb of storage. These averages are somewhat misleading, because while they represent a spread of material, they do not represent the relative proportions of this material in the actual collection as a whole (and we can't estimate that yet).&lt;br /&gt;&lt;br /&gt;File size reduction was relatively minimal between 2:1 and 10:1. What is obvious here is that setting the compression rate at, say, 2:1 does not give you a 2:1 ratio. You can achieve in fact a 14:1 ratio or higher. An interesting point to make about the very high experimental compression rates of 25:1 and above, was that output file sizes were essentially &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_16"&gt;homogeneous&lt;/span&gt; across all the images, where as at 10:1 and lower, file sizes ranged from 1.5 Mb to 11.5 Mb.&lt;br /&gt;&lt;br /&gt;TIFF = 35 Mb&lt;br /&gt;1:1/2:1 = 4.96 Mb (86% reduction)&lt;br /&gt;4:1 = 4.56 Mb (87% reduction)&lt;br /&gt;6:1 = 3.89 Mb (89% reduction)&lt;br /&gt;10:1 = 2.87 Mb (92% reduction)&lt;br /&gt;25:1 = 1.39 Mb (96% reduction)&lt;br /&gt;50:1 = 0.72 Mb (98% reduction)&lt;br /&gt;100:1 = 0.37 Mb (99% reduction)&lt;br /&gt;&lt;br /&gt;We found that the most colourful images in the collection (such as a colour photograph of a painting) performed the worst, as expected, and started to show &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_17"&gt;artefacts&lt;/span&gt; at 10:1. These were extremely minor &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_18"&gt;artefacts&lt;/span&gt;, but they could be seen. Other material types were impossible to differentiate from the originals even at 50:1 or 100:1, surprisingly. These tended to be black and white textual items. Using our rule of thumb, we chose 6:1 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_19"&gt;lossy&lt;/span&gt; compression for the archive collections. Were an archive to consist solely of printed pieces of paper, we would reassess and choose a higher compression rate, but an 89% reduction was highly acceptable in storage savings terms.&lt;br /&gt;&lt;br /&gt;You may ask: why not just use 1:1 across the board? Is the extra saving actually worth it? Viewed in comparison to the 1:1 setting, we were getting a better than 20% reduction at 6:1 on average. This continues to represent a significant storage saving when you consider the ultimate goal is to digitise around 3.5 million images from the archive collections. Bearing in mind all the other collections we plan to digitise in future (up to 30m images), the savings become further magnified if we strive to reduce file sizes within the limits of what is visually acceptable.&lt;br /&gt;&lt;br /&gt;There are a couple of follow-on questions remaining from all this: first, what size original should you begin with? And secondly, is it possible to automate compression using a quality control (such as peak to signal noise ratio) that allows you to compress different images at different rates depending on an accepted level of accuracy. These will be the subject future posts.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-1498688687175887369?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/1498688687175887369/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=1498688687175887369&amp;isPopup=true' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1498688687175887369'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1498688687175887369'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/08/as-result-of-our-decision-to-go-lossy.html' title='Determining rates of JPEG 2000 compression on a collection-by-collection basis'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-2851759646166770901</id><published>2010-08-13T10:12:00.003+01:00</published><updated>2010-08-13T11:13:16.168+01:00</updated><title type='text'>The JPEG2000 problem for this week</title><content type='html'>JPEG2000 isn’t the easiest of formats to disseminate. Browsers typically handle the format with difficulty and then require plugins or extensions to render the format. We don’t want our users to have to download anything just to be able to view our material on-line. So, we plan to convert our JPEG2000 files to a browser friendly JPEG or PDF for dissemination. Both formats admirably handled by browsers. (OK, PDF needs an Adobe &lt;a href="http://get.adobe.com/uk/reader/otherversions/"&gt;plugin &lt;/a&gt;but it's commonly included with browsers.) Other formats may come along later. The thing is, how do we do that conversion? There are plenty of conversion tools out there – we use &lt;a href="http://www.luratech.com/products/imaging-solutions.html"&gt;Lurawave &lt;/a&gt;for the image conversion. But then the question becomes when do we convert from a master to a dissemination format? Especially if we want a speedy delivery of content to the end user.&lt;br /&gt;&lt;br /&gt;One of the guiding principles behind our decision to use JPEG2000 was that we could reduce our overall storage requirements by creating smaller files than we might have done if we’d used, say, TIFF. So if we automatically convert every JPEG2000 to a low res thumbnail JPEG, a medium res JPEG and a high res JPEG and to a PDF then we’re back to having to find storage for these dissemination files. OK, JPEG won’t consume terabytes of storage and nor will PDF, but we’d need structured storage to keep track of each manifestation and metadata to provide to our front end delivery system as to which JPEG was to be used in which circumstances. True, this has been very successfully done for many projects before now but alongside efficiency of storage is efficiency of managing what we have stored and a speedy delivery.&lt;br /&gt;&lt;br /&gt;So we plan to convert JPEG2000 to JPEG or PDF on-the-fly at the time each image is requested. The idea is that we serve JPEG2000 images out of our DAM to an image server, the image is converted and the dissemination file served up. Instead of paying for large volumes of static storage we believe that putting the saving on storage into a fast image server will directly benefit those who want to use our material online.&lt;br /&gt;&lt;br /&gt;One outcome of a conversation had with &lt;a href="http://www.dlconsulting.com/"&gt;DLConsulting &lt;/a&gt;is that we've learned that on-the-fly conversion is a potentially system intensive (and at worst inefficient) activity that could create a bottleneck in the delivery of content to the end user. We've said that speed is an issue. We need to efficiently process the tiled and layered JPEG2000 files we plan to create. A faster more powerful image server may help but good conversion software qwill be key. Alongside on-the-fly conversion we plan to use a cache that would hold, in temporary storage the most requested images/PDFs. The cache would work something like this. It has a limited size/capacity and contains the most popular/most often requested images/PDFs. If an image/PDF in the cache were not requested for n amount of time it would be removed from the cache. In practice a user requests an digitised image of a painting, the front end system queries the cache to see if the image is there, if it is its served directly and swiftly to the user. If not the front end system calls the file from the back end DAM. The DAM delivers that image to the image server, which converts JPEG2000 to JPEG and places that images in the cache. From where it can be passed to the front end system and the end user. Smooth, fast and efficient in the use of system resources.&lt;br /&gt;&lt;br /&gt;But there are still questions. If we pass the JPEG2000 to the image server for conversion to JPEG that’s fine; but what happens next? Is the JPEG2000 discarded after the conversion process leaving only the JPEGs? Is this the best way to support the zooming in on image sections that we want to offer. The original proposal was to hold only dissemination formats in the cache, now we’re thinking that for flexibility we may prefer to hold the JPEG2000 images and convert them as the image is requested by a user. Is this still the most efficient process? It's easy to build bottlenecks into a system that slow processes down, much more difficult to design a system for speed and efficiency. We’re pretty certain that the conversion–on-the-fly is a good idea and we also think the cache is too. Unless you know differently….&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-2851759646166770901?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/2851759646166770901/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=2851759646166770901&amp;isPopup=true' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/2851759646166770901'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/2851759646166770901'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/08/jpeg2000-problem-for-this-week.html' title='The JPEG2000 problem for this week'/><author><name>dnt</name><uri>http://www.blogger.com/profile/11218789008554869322</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='21' height='32' src='http://4.bp.blogspot.com/_M2W_1JyTKf8/SQgicvDXBgI/AAAAAAAAAAk/_9wEIMolmas/S220/dnt_digpres_awards_pic_2006.jpg'/></author><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-1345613924636758992</id><published>2010-07-21T08:54:00.003+01:00</published><updated>2010-07-21T12:22:37.121+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='migration'/><category scheme='http://www.blogger.com/atom/ns#' term='preservation'/><category scheme='http://www.blogger.com/atom/ns#' term='JPEG2000'/><category scheme='http://www.blogger.com/atom/ns#' term='JP2K'/><title type='text'>Future migration of JPEG2000</title><content type='html'>Those of us who work with digital assets know that one day we’ll face format obsolescence. The formats we have in our care will no longer be rendered by the applications that created them or by readily obtainable alternatives. This applies to all formats not just JPEG2000. As a relatively new and untried format planning for the long term management of JPEG2000 will require some work.&lt;br /&gt;&lt;br /&gt;The key challenge with migration as a strategy is not deciding how to do migration but how to identify and maintain the significant properties of the format being migrated. The danger is that some property of the format may be lost during the process. The biggest fear with images being that quality will deteriorate over time. This loss of quality, whilst insignificant in the initial migration, may have a detrimental cumulative and irreversible effect over time.&lt;br /&gt;&lt;br /&gt;So, do we have a plan for the future migration of obsolete JPEG2000 files? No, we do not. We are still trying to develop the specifications for the types of JPEG2000 that we want to use. Beyond the pale or not we have accepted that our images will be lossy. What we are trying to do is create JPEG2000 images that are consistent, have a minimal range of compression ratios and have a few variations in technical specifications as we can provide for. As a start this will make long term management simpler, but we are aware that we still have a way to go.&lt;br /&gt;&lt;br /&gt;Our promotion of JPEG2000 as a format will hopefully make it more widely accepted and therefore the format will attract more research into possible migration options. We’re pleased to see that already individuals and organisations have been thinking about future migration of JPEG2000. The development of tools such as &lt;a href="http://www.planets-project.eu/"&gt;Planets &lt;/a&gt;in recent years has been a great step forward in supporting decision making around the long term management of formats.&lt;br /&gt;&lt;br /&gt;Obsolescence is not something totally beyond our control. We are free to decide when obsolescence actually occurs, when it becomes a problem we need to deal with, and, with proper long term management strategies how we plan to migrate from obsolete formats to current ones. The choice of JPEG2000 as a master format supports this broader approach to data management.&lt;br /&gt;&lt;br /&gt;The long term management of JPEG2000 as a format is part of our overall strategy for the creation of a digital library. Ease of use, the ability to automate processes and the flexibility of JPEG2000 have all been factors in our decision to use the format.&lt;br /&gt;&lt;br /&gt;We’re clear that the choice we have made in the specification of our JPEG2000 images is a pragmatic one. Its also clear that the decision to use JPEG2000 in a lossy format has consequences. However, we have a format that we can afford to store and one that offers flexibility in the way that we can deliver material to end users. For us this balance is important, probably more important than any single decision about one aspect of a formats long term management.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-1345613924636758992?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/1345613924636758992/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=1345613924636758992&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1345613924636758992'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/1345613924636758992'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/07/future-migration-of-jpeg2000.html' title='Future migration of JPEG2000'/><author><name>dnt</name><uri>http://www.blogger.com/profile/11218789008554869322</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='21' height='32' src='http://4.bp.blogspot.com/_M2W_1JyTKf8/SQgicvDXBgI/AAAAAAAAAAk/_9wEIMolmas/S220/dnt_digpres_awards_pic_2006.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-5783670768881163773</id><published>2010-07-13T09:39:00.001+01:00</published><updated>2010-07-13T09:41:40.812+01:00</updated><title type='text'>Lossy v. lossless compression in JPEG 2000</title><content type='html'>The arguments for and against using &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;JPEG&lt;/span&gt; 2000 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;lossy&lt;/span&gt; files for long-term preservation are largely centred around two issues: 1) that the original capture image is the true representation of the physical item, and therefore all the information captured at digitisation should be preserved; and 2) that &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;lossy&lt;/span&gt; compression (as opposed to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;lossless&lt;/span&gt; compression) will permanently discard some of this important information from the digital image. Both of these statements can be challenged, and the Buckley/Tanner &lt;a href="http://library.wellcome.ac.uk/assets/wtx056572.pdf"&gt;report &lt;/a&gt;went some way to doing this.&lt;br /&gt;&lt;br /&gt;The perceived fidelity of the original captured image is the root of the attachment to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;lossless&lt;/span&gt; image formats. As cameras have improved, so has the volume of information captured in the RAW files. This volume of information has of course improved the visual quality and accuracy of the images, but this comes at the cost of inflated file sizes. A high-end &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_5"&gt;dSLR&lt;/span&gt; camera will produce RAW files of around 12Mb. A RAW file produced by a medium-format camera may be 50Mb or higher. When a RAW file is converted to a TIFF, file sizes can increase dramatically depending on the bit-depth chosen due to interpolating &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_6"&gt;RGB&lt;/span&gt; values for each pixel captured in the RAW file. As RAW files can only be rendered (read) by the proprietary software of the camera manufacturer (which may include &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_7"&gt;plugins&lt;/span&gt; for 3rd party applications like &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_8"&gt;Photoshop&lt;/span&gt;), they cannot be used for access purposes and, being proprietary, are not a good preservation format. They must be converted to a format suited to long term management, and this has usually been TIFF. When a RAW file is converted to a TIFF, file sizes can increase dramatically depending on the bit-depth chosen due to interpolating &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_9"&gt;RGB&lt;/span&gt; values for each pixel captured in the RAW file. This bloats the storage requirements by 2 to 4 times.&lt;br /&gt;&lt;br /&gt;However, image capture and subsequent storage of large images, is expensive, and we don't want to have to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_10"&gt;redigitise&lt;/span&gt; objects &lt;span style="font-style: italic;"&gt;ever&lt;/span&gt; if we can get away with it - particularly for large scale projects. So, how much of a compromise is &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_11"&gt;lossy&lt;/span&gt; compression, and is it really worth it? The question is: what information are we actually capturing in our digital images? Do we we need all that information? Is any of it redundant?&lt;br /&gt;&lt;br /&gt;First - the visual fidelity issue. Fidelity to what information? The visual appearance of a physical item as defined by one person in a particular light? The visual appearance as &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_12"&gt;perceived&lt;/span&gt; through a specific type of lens? All the pixels and colour information contained in the image as captured under particular conditions? No two images taken through the same camera even seconds apart will look the same due to distortions caused by the equipment, and, possibly, noise levels. What makes any particular pixel the &lt;span style="font-style: italic;"&gt;original&lt;/span&gt; representation, or the &lt;span style="font-style: italic;"&gt;most accurate&lt;/span&gt;, or indeed at all &lt;span style="font-style: italic;"&gt;important&lt;/span&gt;?&lt;br /&gt;&lt;br /&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_13"&gt;Lossy&lt;/span&gt; compression &lt;span style="font-style: italic;"&gt;will&lt;/span&gt; permanently discard data. What is necessary is to determine - for any given object, set of objects, or purpose - what information is actually useful and necessary to retain. We already balance these decisions at the capture stage. Choosing to use a small-format camera immediately limits the amount of information that can be detected by the camera sensor. Choosing one lens over another introduces a slightly different distortion.  Compression also represents a choice between what you can &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_14"&gt;capture&lt;/span&gt; and what you actually need. One may not need all the information that has been captured; &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_15"&gt;some&lt;/span&gt; of it may be redundant. A lot of it may be redundant. And the point of &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_16"&gt;JPEG&lt;/span&gt; 2000 is that it is very good at removing &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_17"&gt;redundant&lt;/span&gt; information.&lt;br /&gt;&lt;br /&gt;At the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_18"&gt;Wellcome&lt;/span&gt; Library, the aim of our large-scale digitisation projects is to provide access. We do not want to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_19"&gt;redigitise&lt;/span&gt; in the future, but we do not see the digital manifestations as the "preservation" objects. The &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_20"&gt;physical&lt;/span&gt; item is the preservation copy, whether that is a book, a unique oil painting, or a copy of a letter to Francis Crick. For us, the important information captured in a digital manifestation are the human-visible properties. Images should be clear and in-focus, details visible on the original should be visible in the image (so it must be large enough to see quite small details), colour should be as close to the original as possible in daylight conditions and consistent, and there should be no visible digital &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_21"&gt;artefacts&lt;/span&gt; at 100%. This is the standard for an image as captured.&lt;br /&gt;&lt;br /&gt;We are striking a balance. Can we compress this image and retain all these important qualities? Yes. Do we need to retain information that doesn't have any &lt;span class="blsp-spelling-corrected" id="SPELLING_ERROR_22"&gt;relevance&lt;/span&gt; to these qualities? No. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_23"&gt;Lossy&lt;/span&gt; compression works for us. Using these qualities as a basis, we set out a testing strategy to determine how much compression our images could withstand.&lt;br /&gt;&lt;br /&gt;To be continued...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-5783670768881163773?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/5783670768881163773/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=5783670768881163773&amp;isPopup=true' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/5783670768881163773'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/5783670768881163773'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/07/lossy-v-lossless-compression-in-jpeg.html' title='Lossy v. lossless compression in JPEG 2000'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-7481686637635405801</id><published>2010-07-06T09:50:00.017+01:00</published><updated>2010-07-21T09:08:18.798+01:00</updated><title type='text'>Finding a JPEG 2000 conversion tool</title><content type='html'>It should be stated straight away that we don't have any programming capacity at the Wellcome Library (or the Wellcome Trust, our parent company). We don't do any in-house software development, and we don't use open source software much as a result. When it comes to using and creating the JPEG 2000 file format, this immediately limited our options regarding what tools we could use. Imaging devices do not output JPEG 2000, and even if they did, we would prefer to convert from TIFF to allow us full control over the options and settings. To achieve this, we needed a reliable file conversion utility.&lt;br /&gt;&lt;br /&gt;Richard Clark, as discussed in a previous&lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2010/06/jpeg-2000-workshop-with-richard-clark.html"&gt; blog post&lt;/a&gt;, presented a number of major players providing tools for converting images to JPEG 2000. Of this list, only two offer a graphical user interface (GUI); these were Photoshop and &lt;a href="http://www.luratech.com/products/imaging-solutions.html"&gt;LuraWave&lt;/a&gt;. The other tools, such as Kakadu, Aware, Leadtools, and OpenJPEG are available as software developer kits (SDKs) or binary files and require development work in order to use them.&lt;br /&gt;&lt;br /&gt;We tested Photoshop and LuraWave with a range of images representing material from black and white text to full-colour artworks. We attempted to set options in both products as closely as possible to the Buckley/Tanner &lt;a href="http://library.wellcome.ac.uk/assets/wtx056572.pdf"&gt;recommendations&lt;/a&gt;. We tested compression levels as well, but this is the subject of a future posting.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Photoshop &lt;/span&gt;first began supporting JPEG 2000 with &lt;span style="font-weight: bold;"&gt;CS2&lt;/span&gt;. The plugin - installed separately from the CD - allows the user to view, edit and save JPEG 2000 files as &lt;a href="http://www.digitalpreservation.gov/formats/fdd/fdd000154.shtml"&gt;jpx/jpf &lt;/a&gt;(extended) files (although these can be made compatible with jp2). That means that although the file is a .jpx, you can open it with programs that only work with jp2. This version provided a number of options: tile sizes, embedding metadata, and so on, but was limited. In &lt;span style="font-weight: bold;"&gt;CS3&lt;/span&gt;, the plugin changed. In this version, the plugin used Kakadu to encode the image, and appeared to create a "proper" jp2 file. This version got us much closer to the Buckley/Tanner recommendation. &lt;span style="font-weight: bold;"&gt;CS4&lt;/span&gt; removed the plugin from the installation altogether, requiring the user to download it from the Photoshop downloads website as part of a batch of "legacy" plugins. &lt;span style="font-weight: bold;"&gt;CS5&lt;/span&gt;, however, now includes the plugin as part of the default install. CS5 became available this summer,  so we have not had a chance to investigate this version of the plugin, but their &lt;a href="http://www.adobe.com/products/photoshop/pdfs/ps_cameraraw_userguide.pdf"&gt;userguide &lt;/a&gt;mentions JPEG 2000 in the final section and as before, saves jpx/jpf files as standard.&lt;br /&gt;&lt;br /&gt;It is good news that Photoshop is now including the plugin as standard. However, as the previous versions of the plugin were so variable, and the implementation so non-standard, it became clear that for the time being use of Photoshop is too risky for a large-scale programme. We need flexibility in setting options, images that conform to a standard, and long-term consistency in the availability of the tool and the options it provides.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;LuraWave&lt;/span&gt;, developed by a German company called &lt;a href="http://www.luratech.com/en/home.html"&gt;LuraTech&lt;/a&gt;, provided the GUI interface we needed, so was the obvious choice for testing. We obtained a demo version and using the wide range of options available we seemed able to meet the Buckley/Tanner recommendations in their entirety. We did, however, come across &lt;span style="font-weight: bold;"&gt;two issues&lt;/span&gt; with this software. &lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Firstly, we found that with our particular settings (including multiple quality levels and resolution layers, etc.), the software created an anomaly in the form of a small grey box in certain images where a background border was entirely of a single colour (in our case, black). It was reproducible. We immediately notified the suppliers, who investigated the bug, fixed it, and sent us a new version in a matter of days. The grey boxes no longer appeared. &lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Secondly&lt;span style="font-weight: bold;"&gt;, &lt;/span&gt;when we characterised our converted images with &lt;a href="http://hul.harvard.edu/jhove/"&gt;JHOVE &lt;/a&gt;we found that the encoding was in fact a jpx/jpf wrapped in a jp2 format. We went back to the suppliers who informed us that our TIFFs contained an output &lt;a href="http://en.wikipedia.org/wiki/Color_management"&gt;ICC profile&lt;/a&gt; that was incompatible with their implementation of jp2. The tool was programmed to encode to jpx/jpf when an output ICC profile was detected. This was a bit of a blow - we use Lightroom to convert our raw images to TIFF, and Lightroom automatically embeds an ICC profile. We would either have to strip the ICC profiles from our images before conversion, or the software would need to accommodate us.&lt;br /&gt;&lt;br /&gt;Happily, Luratech were able to re-programme the conversion tool to force jp2 encoding (ignore the ICC profile), with an option to allow it to encode to jpx/jpf if the ICC profile is detected (see screenshot of the relevent options below). We have now purchased this revised version, and will soon be integrating JPEG 2000 conversion into our digitisation workflow. Of course, all this talk of ignoring ICC profiles and so on leads us to some issues around colour space and colour space metadata in JPEG 2000. We also had an interesting experience using JHOVE, that we will talk about soon. Watch this space!&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;UPDATE&lt;/span&gt; &lt;span style="font-weight: bold;"&gt;July 2010&lt;/span&gt;: In order to ignore the ICC profile, an additional command has to be added to the command line, as shown in the following images:&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_hR6lGOqlUv0/TEapYSEGs8I/AAAAAAAAAJs/UvSJr-RJrWM/s1600/L-wave.jpg"&gt;&lt;img style="float: left; margin: 0pt 10px 10px 0pt; cursor: pointer; width: 292px; height: 400px;" src="http://4.bp.blogspot.com/_hR6lGOqlUv0/TEapYSEGs8I/AAAAAAAAAJs/UvSJr-RJrWM/s400/L-wave.jpg" alt="" id="BLOGGER_PHOTO_ID_5496266629883278274" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;a onblur="try  {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_hR6lGOqlUv0/TDRKRhJwxwI/AAAAAAAAAJM/6z4D8f2E9vc/s1600/L-wave.jpg"&gt;&lt;br /&gt;&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_hR6lGOqlUv0/TEapkTwuBRI/AAAAAAAAAJ0/ESFJ_2F5Ma0/s1600/LuraWave_new_command.jpg"&gt;&lt;img style="float: left; margin: 0pt 10px 10px 0pt; cursor: pointer; width: 400px; height: 72px;" src="http://4.bp.blogspot.com/_hR6lGOqlUv0/TEapkTwuBRI/AAAAAAAAAJ0/ESFJ_2F5Ma0/s400/LuraWave_new_command.jpg" alt="" id="BLOGGER_PHOTO_ID_5496266836497270034" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-7481686637635405801?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/7481686637635405801/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=7481686637635405801&amp;isPopup=true' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/7481686637635405801'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/7481686637635405801'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/07/finding-jpeg-2000-conversion-tool.html' title='Finding a JPEG 2000 conversion tool'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_hR6lGOqlUv0/TEapYSEGs8I/AAAAAAAAAJs/UvSJr-RJrWM/s72-c/L-wave.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-6805989059406769103</id><published>2010-06-25T09:58:00.003+01:00</published><updated>2010-06-25T10:18:37.615+01:00</updated><title type='text'>JPEG 2000 workshop with Richard Clark</title><content type='html'>In the wake of taking on board the recommendations from the Buckley/Tanner report (see a previous blog &lt;a href="http://jpeg2000wellcomelibrary.blogspot.com/2010/06/bringing-in-experts.html"&gt;post&lt;/a&gt;), we needed to start looking at how we would actually create these JPEG 2000s as part of our digitisation workflow. As the JP2K-UK group meeting showed us, there is not a lot of knowledge in our industry regarding to the tools we could use - not only for creating the JPEG 2000s in the first place, but also for managing, displaying and converting them back into other (browser-friendly, for example) formats. We knew of a few tools, but wanted a more thorough understanding of the possibilities.&lt;br /&gt;&lt;br /&gt;We turned to software engineer Richard Clark, who was deeply involved in the JPEG Committee and has worked on the JPEG 2000 technology. Richard is based in the UK, and currently owns &lt;a href="http://www.elysium.ltd.uk/index.xalter"&gt;Elysium Ltd.&lt;/a&gt;, offering software and IT support solutions to businesses and organisations. Richard Clark was asked to deliver a half-day workshop for those Wellcome Library staff that would be involved in implementing our JPEG 2000 solution.&lt;br /&gt;&lt;br /&gt;The workshop focused on options for the practical implementation of JPEG 2000, and the situation regarding software support for the format. He also touched on the workflow issues we need to be aware of and address in planning our strategy. The workshop helped us determine which solution would work best for us - as will be described in subsequent posts on this blog. You can read a version of his presentation as embedded in this post. Richard also shared with us some of the more technical details from his &lt;a href="http://www.scribd.com/doc/33536526/JPEG-and-JPEG-2000-Past-Present-and-Future"&gt;presentation &lt;/a&gt;at the British Library in 2007, available on Scribd.&lt;br /&gt;&lt;br /&gt;&lt;a title="View J2K Workshop for the Wellcome Library on Scribd" href="http://www.scribd.com/doc/33536412/J2K-Workshop-for-the-Wellcome-Library" style="margin: 12px auto 6px; font: 14px Helvetica,Arial,Sans-serif; display: block; text-decoration: underline;"&gt;J2K Workshop for the Wellcome Library&lt;/a&gt; &lt;object id="doc_94168735522504" name="doc_94168735522504" type="application/x-shockwave-flash" data="http://d1.scribdassets.com/ScribdViewer.swf" style="outline: medium none;" rel="media:presentation" resource="http://d1.scribdassets.com/ScribdViewer.swf?document_id=33536412&amp;amp;access_key=key-2orhaleqonl1hx56y8lc&amp;amp;page=1&amp;amp;viewMode=slideshow" media="http://search.yahoo.com/searchmonkey/media/" dc="http://purl.org/dc/terms/" height="500" width="100%"&gt; &lt;param name="movie" value="http://d1.scribdassets.com/ScribdViewer.swf"&gt; &lt;param name="wmode" value="opaque"&gt; &lt;param name="bgcolor" value="#ffffff"&gt; &lt;param name="allowFullScreen" value="true"&gt; &lt;param name="allowScriptAccess" value="always"&gt; &lt;param name="FlashVars" value="document_id=33536412&amp;amp;access_key=key-2orhaleqonl1hx56y8lc&amp;amp;page=1&amp;amp;viewMode=slideshow"&gt; &lt;embed id="doc_94168735522504" name="doc_94168735522504" src="http://d1.scribdassets.com/ScribdViewer.swf?document_id=33536412&amp;amp;access_key=key-2orhaleqonl1hx56y8lc&amp;amp;page=1&amp;amp;viewMode=slideshow" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" wmode="opaque" bgcolor="#ffffff" height="500" width="100%"&gt;&lt;/embed&gt; &lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-6805989059406769103?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/6805989059406769103/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=6805989059406769103&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/6805989059406769103'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/6805989059406769103'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/06/jpeg-2000-workshop-with-richard-clark.html' title='JPEG 2000 workshop with Richard Clark'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-8991607672458410288</id><published>2010-06-22T22:41:00.005+01:00</published><updated>2010-06-22T23:24:49.084+01:00</updated><title type='text'>Initiating the JP2K-UK Implementation Working Group</title><content type='html'>By Autumn 2009, we were committed to using the JP2 format for our digitisation projects. However, we knew that the lack of good information and communication between practitioners was a risk factor. First of all, we wanted to know what was going on so we didn't have to keep re-inventing the wheel. Has anyone else carried out compression tests on historic materials? Who uses which tools, and why? Secondly, if we could improve communications, presumably more people would feel comfortable about using JPEG 2000 serving to broaden the user base and further entrench the format into practice - essential to ensuring longevity.&lt;br /&gt;&lt;br /&gt;This information wasn't just going to come out of the woodwork - or not as quickly as we would have liked - so we set up the "JP2K-UK Implementation Working Group", with a starting membership of one. We then cold-called a number of contacts from relevent organisations to test the level of interest in joining such a group. We optimistically booked a small meeting room, with a free lunch as an added temptation, hoping someone would be interested.&lt;br /&gt;&lt;br /&gt;Someone was indeed interested; in fact, nearly every single person or organisation we contacted had a high level of interest in JPEG 2000, and most were actively pursuing JPEG 2000 implementation in some way as a practitioner, consultant or software developer. We booked a much bigger room, shelled out for a lot more sandwiches, and realised we needed a proper agenda. At this point we set up the &lt;a href="http://jp2k-uk.wikidot.com/"&gt;JP2K-UK wiki&lt;/a&gt; to consolidate online resources, provide dates of any events, and list the member organisations.&lt;br /&gt;&lt;br /&gt;Our first meeting was held in December, and we had 17 initial individual members representing 12 organisations, drawn primarily from the library world (see the wiki for member organisations). The bulk of the meeting was taken up with small groups, discussing what they knew of the different technical aspects of JPEG 2000 (formats and features, compression, IPR, and tools) where the knowledge gaps were, what the general opinion of JPEG 2000 was, and how we might act to make the use of JPEG 2000 a little bit easier for everyone.&lt;br /&gt;&lt;br /&gt;Not surprisingly, there was a range of opinion, levels of knowledge and understanding, and intended use of the format; but every attendee was keen to work toward creating a resource for practitioners and disseminating information with a series of workshops, seminars and/or conferences. As an initial discussion the meeting set the tone for the future of the group, and further developments will be posted here very soon.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-8991607672458410288?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/8991607672458410288/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=8991607672458410288&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/8991607672458410288'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/8991607672458410288'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/06/initiating-jp2k-uk-implementation.html' title='Initiating the JP2K-UK Implementation Working Group'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-4562348612532696258</id><published>2010-06-18T18:28:00.012+01:00</published><updated>2010-06-19T22:52:51.974+01:00</updated><title type='text'>Bringing in the experts</title><content type='html'>There are numerous articles, reviews, and technical reports on the JPEG 2000 format, many free to view online. Despite this, we found it difficult to determine how we could make best use of the format in a practical way. There are 13 "parts" to JPEG 2000 - from basic image formats to a metadata format, and even a digital cinema format. Mostly these parts are extensions to the core specification. Through our own reading, we were able to determine that Parts 1 and 2 were the ones we needed to look at. But which one to use? &lt;a href="http://www.jpeg.org/jpeg2000/j2kpart1.html"&gt;Part 1&lt;/a&gt; specifies both a compression algorithm, and a format. &lt;a href="http://www.jpeg.org/jpeg2000/j2kpart2.html"&gt;Part 2&lt;/a&gt; specifies a different algorithm, and extensions to the format. We could find little - short of becoming a technical expert - that would allow us to adequately weigh up the pros and cons of the various options, and even less on how others have made their decisions.&lt;br /&gt;&lt;br /&gt;In Spring 2009 we turned to Simon Tanner, Director of &lt;a href="http://www.digitalconsultancy.net/"&gt;Kings Digital Consultancy Services&lt;/a&gt;, for some advice. Simon agreed to search out the experts and provide us with a report setting out clear recommendations: primarily which format and compression to use for preservation and access, and what features we should implement. We provided him with a brief of our requirements, the background to our intended digitisation activities, and some sample images.&lt;br /&gt;&lt;br /&gt;Simon did find an expert to work on the report: Robert Buckley, colour digital imaging expert and member of the JPEG Committee. Rob carried out a number of tests on the images we supplied looking at the implications of lossless v. lossy compression, how we might get the best out of certain JPEG 2000 features, how we should manage technical metadata, and more. This provided the evidence, set out in the report, that backed up his final recommendations.&lt;br /&gt;&lt;br /&gt;The key recommendation was that we use the &lt;strong&gt;Part 1 compression and JP2 format&lt;/strong&gt; for our digitisation projects, for both the archival master format as well as the access copy. Also important was the recommendation that we use a lossy rather than a lossless format - maintaining a high quality that could be considered "&lt;strong&gt;visually lossless&lt;/strong&gt;". Although this results in a loss of information that is non-recoverable, the data that is lost was never visible to the human eye, and therefore simply unnecessary for our needs. The Wellcome Library intends to follow the recommendations as closely as possible for future digitisation projects, although exact compression levels used would need to be determined on a collection-by-collection basis with further tests.&lt;br /&gt;&lt;br /&gt;The &lt;a href="http://library.wellcome.ac.uk/assets/wtx056572.pdf"&gt;report &lt;/a&gt;is available to view on our website.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-4562348612532696258?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/4562348612532696258/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=4562348612532696258&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/4562348612532696258'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/4562348612532696258'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/06/bringing-in-experts.html' title='Bringing in the experts'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3808045742964712463.post-5273421595630985082</id><published>2010-06-16T18:44:00.009+01:00</published><updated>2010-06-19T23:05:56.917+01:00</updated><title type='text'>We need how much storage?</title><content type='html'>In 2009, the &lt;a href="http://library.wellcome.ac.uk/"&gt;&lt;span id="SPELLING_ERROR_0" class="blsp-spelling-error"&gt;Wellcome&lt;/span&gt; Library&lt;/a&gt; set out an ambitious vision to digitise a large proportion of its historic collections. This would take the annual digitisation activities of the Library from hundreds, or at most, thousands of images per year to several million images per year. Collections were to include a wide range of content types - archives, printed books from the 15&lt;span id="SPELLING_ERROR_1" class="blsp-spelling-error"&gt;th&lt;/span&gt; to the 20&lt;span id="SPELLING_ERROR_2" class="blsp-spelling-error"&gt;th&lt;/span&gt; century, manuscripts, paintings and drawings, ephemera. Once we added up all these collections, using broad estimates of what we believed was there, we realised this could see the generation of up to 30m images over 5 years. Exciting, but perhaps slightly daunting, considering we didn't yet have an infrastructure to fully support such a large collection of digital assets.&lt;br /&gt;&lt;br /&gt;Anyone reading this blog will understand why the scale of the programme is key to the blog topic. When we asked our IT department to tell us how much it would cost to store 30m TIFF files - our &lt;span id="SPELLING_ERROR_3" class="blsp-spelling-error"&gt;de&lt;/span&gt; facto standard for the couple hundred thousand images in our existing &lt;a href="http://images.wellcome.ac.uk/"&gt;picture library&lt;/a&gt; - we were stunned. Two petabytes of online, spinning disk storage with a top-of-the-line enterprise management system and remote backup would cost &lt;em&gt;how much?&lt;/em&gt; We learned that the cost would be something like a fifth of our total budget for the entire digitisation programme.&lt;br /&gt;&lt;br /&gt;Should we consider a lower-cost storage solution? Even tape back-up was quite expensive for that scale, and you can't serve images up online from tape anyway. We revised our image sizes, factoring in smaller and smaller resolutions and/or bit depths for material like the printed books, which didn't need full colour, high resolution images. We still couldn't afford the storage costs.&lt;br /&gt;&lt;br /&gt;Finally, we saw the light and started looking into a relatively new image format called &lt;a href="http://en.wikipedia.org/wiki/JPEG_2000"&gt;&lt;span id="SPELLING_ERROR_4" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000&lt;/a&gt;. We knew almost nothing about it, except that it employed an extremely &lt;span id="SPELLING_ERROR_5" class="blsp-spelling-corrected"&gt;efficient&lt;/span&gt; compression algorithm that could, possibly, allow us to reduce our storage costs without compromising too much on quality.&lt;br /&gt;&lt;br /&gt;This was the start of our journey into the complicated and mystifying world of &lt;span id="SPELLING_ERROR_6" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000. This blog charts our progress up to date in &lt;span id="SPELLING_ERROR_7" class="blsp-spelling-corrected"&gt;determining&lt;/span&gt; what type of &lt;span id="SPELLING_ERROR_8" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000 we would use, how we would use it, and how it would impact on the rest of the Digital Library infrastructure. We have by no means worked out all the details around how we are going to implement &lt;span id="SPELLING_ERROR_9" class="blsp-spelling-error"&gt;JPEG&lt;/span&gt; 2000, so this blog will also serve as a diary of our progress as we go along. Happy reading, and feel free to post comments.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3808045742964712463-5273421595630985082?l=jpeg2000wellcomelibrary.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://jpeg2000wellcomelibrary.blogspot.com/feeds/5273421595630985082/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3808045742964712463&amp;postID=5273421595630985082&amp;isPopup=true' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/5273421595630985082'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3808045742964712463/posts/default/5273421595630985082'/><link rel='alternate' type='text/html' href='http://jpeg2000wellcomelibrary.blogspot.com/2010/06/we-need-how-much-storage.html' title='We need how much storage?'/><author><name>Christy Henshaw</name><uri>http://www.blogger.com/profile/13179015500410216822</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry></feed>
