Data Management News

Data Management News

  • 5/23/2013
    We're seeking your opinion on our new draft checklist for data management planning, produced as part of the development of our suite of data management planning tools and resources. The new checklist is simpler and is accompanied by a set of themes which (amongst other things) will help to relate guidance from multiple sources to particular questions dealt with in a plan. We're looking for feedback by June 3rd; read the full article for details and the documents themselves.

    Read more

  • 5/21/2013

    At DataCite we aim to be as open as possible, facilitating access to our resources from third parties. In this context we would like to present three external tools that might be useful.

    When we want to assign a DOI to a new dataset it is sometimes complex to create the XML metadata required. Christian Pietsch, from the Bielefeld University Library (Germany) prototyped a DataCite metadata creator in a Python web framework:

    https://github.com/pietsch/DataCite-Metadata-Creator-Django

    You can contact Christian at christian.pietsch+datacite@uni-bielefeld.de.

    Raffaele Messuti, from the University of Bologna (Italy) has created a minimal webapp which given a summary page of a document or data repository and its OAI-PMH endpoint gets the OAI_DC record, apply an XSL stylesheet to convert the schema to DataCite, and call the API to mint the DOI. You can find it in:

    https://github.com/atomotic/oaidc-datacite-web

    You can contact Raffaele at raffaele.messuti2@unibo.it.

    Marcin Paluch, from CISTI (National Research Council Canada) has developed an HTML form for generating the mandatory metadata elements. Although it's rather basic it's also light weight and doesn't require any server back end. This tool could also be expanded to add other elements and even to do minting with CORS support on the DataCite servers.

    You can download the code from here:

    https://dl.dropboxusercontent.com/u/24348767/datacite_meta_gen.html

    You just need to save it to your computer and open it up in a web browser. You'll see that you can start creating metadata right away without the need for any web hosting services or an application server.

    You can contact Marcin at Marcin.Paluch@nrc-cnrc.gc.ca.

    Tanya Gray and David Shotton, from the University of Oxford (UK) have created a DataCite Metadata Input Form. This tool allows you to load an existing DataCite Metadata Report or just start typing to create a new DataCite Metadata Report. It will create for you a single DataCite Metadata Report as an XML document containing the metadata entered. Another possible output format from the tool is DataCite metadata in RDF. You can find it at:

    http://www.miidi.org:8020/datacite/

    You can contact David at david.shotton@zoo.ox.ac.uk and Tanya at tanya.gray@bodleian.ox.ac.uk.

    David has also created, together with Silvio Peroni of the University of Bologna the mapping of the DataCite metadata kernel to RDF, which is used by the DataCite Metadata Input Form to create RDF output. This tool is described at

    http://semanticpublishing.wordpress.com/2012/12/04/revising-datacite2rdf...

    You might also want to read the preceding and following posts in the blog.

    If you want to contact Silvio you can do so at essepuntato@cs.unibo.it.

    Do you have any tool you would like us to comment at the DataCite website? Please, let us know: Sergio.Ruiz@datacite.org.

  • 5/20/2013
    The 4C project is issuing an open consultation to find people interested in clarifying the costs of curation for their organisation. It consists of an online questionnaire that will start a communication channel for further engagement.  

    Read more

  • 5/14/2013

    Today the 3TU.Datacentrum and DANS are kicking off the partnership Research Data Netherlands. With this coalition, which is also open to other parties, the two data archives want to join forces in the field of sustainable data archiving.

    3TU.Datacentrum and DANS have been working together for some time, including the organization of the biennial Dutch Data Award and the course Data Intelligence 4 Librarians. The establishment of the coalition strengthened this cooperation, says DANS director Peter Doorn: "Research Data Netherlands brings us a step further towards our shared mission: promoting sustainable accessibility and sound reuse of scientific research. This ultimately benefits all of science and the humanities". Wilma van Wezenbeek, the TU Delft Library director and Secretary of 3TU.Datacentrum, also welcomed the initiative: "It's a logical step to work together because we can all address issues around research discipline and settings across different approaches in a much more efficient and customer-friendly way".

    3TU website: http://datacentrum.3tu.nl/

    DANS website: http://www.dans.knaw.nl/

    More information (in Dutch) at http://dans.knaw.nl/en/node/2668.

  • 5/9/2013

    Under the terms of the Executive Order and a new Open Data Policy released today by the Office of Science and Technology Policy and the Office of Management and Budget, all newly generated government data will be required to be made available in open, machine-readable formats, greatly enhancing their accessibility and usefulness, while ensuring privacy and security.

    See: http://www.whitehouse.gov/blog/2013/05/09/landmark-steps-liberate-open-data

  • 5/8/2013
    Research Data Netherlands partnership launched

    Read more

  • 4/30/2013
    Brief report on a webinar given by Joy Davidson and Sarah Jones to the LIASA HELIG community. Photo of South African flag by Mister-E: http://www.flickr.com/photos/mister-e/3050826409

    Read more

  • 4/18/2013
    Details of a 3 hour training session for librarians developed in collaboration with the University of Northampton as part of the Institutional Engagement programme. The course slides and handbook are CC-BY licensed for reuse.

    Read more

  • 4/12/2013

    Again it is time to present you the most popular datasets of last month,
    based on http://stats.datacite.org . So here are the top 10 datasets of March , registered by DataCite.

    It is worth mentioning one reason that again most of the top 10 come from figshare. Whereas data published in classical data repositories is usually accessed directly through a search at the data center without using the DOI resolution, publishing the DOI name is the ideal advertising for any figshare content, thus increasing the resolution counts. That is one reason, why figshare DOI names seem to be so popular. Nevertheless the resolution number provide an interesting metric for usage of datasets.

    Number 1: 7610 resolutions
    Figure 7 raw data: Effect of variable exposure to PTHrP (1-36) on bone nodules and AP activity in high plating density cultures. (2013)
    Suzan Kamel, John Yee.
    http://dx.doi.org/10.6084/m9.figshare.154685

    Number 2: 7280 resolutions
    Figure 6 raw data: Effect of intermittent and continuous exposure to PTHrP (1-36) on cell proliferation and apoptosis in RC cell cultures. (2013)
    Suzan Kamel, John Yee.
    http://dx.doi.org/10.6084/m9.figshare.154684

    Number 3: 6433 resolutions
    GenoCAD Tutorial I. (2013)
    Mary Mangan, Mandy Wilson, Laura Adam, Jean Peccoud.
    http://dx.doi.org/10.6084/m9.figshare.153827

    Number 4: 3107 resolutions
    GenoCAD Legacy Grammars. (2013)
    Jean Peccoud, Michael Czar, Yizhi Cai.
    http://dx.doi.org/10.6084/m9.figshare.157107

    Number 5: 2129 resolutions
    Introduction to the UCSC Genome Browser. (2012)
    Mary Mangan.
    http://dx.doi.org/10.6084/m9.figshare.96258

    Number 6: 1739 resolutions
    Supplementary material for the Paper "Perception of Focused Sources in Wave Field Synthesis". (2013)
    Hagen Wierstorf.
    http://dx.doi.org/10.6084/m9.figshare.641440

    Number 7: 735 resolutions
    H. Mosser; Krems/AT (2013)
    C-0364 Breast cancer screening in Austria: online survey on women's level of information, anxiety and trust in information sources in order to assess the extent of informed consent for screening.
    10.1594/ecr2013/C-0364

    Number 8: 601 resolutions
    The case for open preprints in biology. (2013)
    Philippe Desjardins-Proulx, Ethan P. White, Joel Adamson, Karthik Ram, Timoth��e Poisot, Dominique Gravel. .
    http://dx.doi.org/10.6084/m9.figshare.655710

    Number 9: 562 resolutions
    Designing command-line interfaces (CLIs) for scientific software. (2013)
    Daniel Standage.
    http://dx.doi.org/10.6084/m9.figshare.643388

    Number 10: 532 resolutions
    Open Access Now! Research notes in the form of a deck of slides assembled by Ernesto Priego for the Open Access debate organised by Roger Sabin at Central Saint Martins, University of the Arts, London, Monday 18 March 2013. (2013)
    Ernesto Priego.
    http://dx.doi.org/10.6084/m9.figshare.654622

    So the top 10 are 5 datasets, 1 medical study, 2 presentations and 2 textual objects.

    Generally in March we had around 750,000 resolutions in total.

  • 3/29/2013
    The Digital Preservation Coalition (DPC) and Digital Curation Centre (DCC) are delighted to announce a new issue of our joint newsletter ‘What’s New’.

    Read more

  • 3/26/2013

    The Spring 2013 issue of DataONENews is now available. Features Include:

    • Report from Bill Michener discussing the recent Whitehouse memo: Increasing Access to the Results of Federally Funded Scientific Research
    • Update on Version 1.1.0 of the DataONE Cyberinfrastructure
    • DUG 2013 Meeting information and agenda
    • Community Engagement and Education Working Group focus article
    • Outreach Update
    • ONER and ONEMercury highlights
    • ORNL DAAC Member Node profile

    The newsletter can be downloaded here.

  • 3/25/2013
    The EC has launched a major new initiative to help organizations invest confidently in digital curation and preservation.  4C Project - ‘the Collaboration to Clarify the Costs of Curation’ - will help organisations across Europe to invest more effectively in digital curation and preservation. 

    Read more

  • 3/25/2013
    The DCC is pleased to announce a number of new resources. The latest guide in our How-to series - How to Develop RDM Services - a guide for HEIs - and three associated case studies.

    Read more

  • 3/21/2013

    Inspired by conversations with potential DataCite clients, the British Library���s Datasets Team has produced two videos intended to demystify the inner workings of the Metadata Store.

    Over the past nine months, the British Library has hosted a series of workshops designed to support data citation and management in the UK research community. Funded by JISC, these workshops have been attended by a range of stakeholders from the UK Higher Education sector; from librarians and data curators to IT staff and academics, and have covered key topics relevant to the management of research data. Over the course of the workshop series we realised that, although many institutions were very keen to adopt DataCite DOIs for their own data holdings, there was a lot of uncertainty about exactly how the process of DOI ���minting��� worked.

    So, in an attempt to demonstrate just how easy it is to use the service, we have made two videos which illustrate the basic functions of the Metadata Store: 1. Minting a DOI and uploading metadata, and 2. Updating an existing DOI. Both videos can be viewed on the British Library website at http://bit.ly/DataCiteFAQ. Although created with British Library DataCite clients in mind, we hope that other DataCite users, or potential users, might find it useful.

    For more information about our workshops and other data-related activity at the British Library, please visit http://bit.ly/workshoparchive

  • 3/18/2013
    Issue 56 of the Society of College, National and University Libraries (SCONUL) Focus newsletter contains a number of intersting pieces relating to research data management.

    Read more

  • 3/13/2013Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    On Friday March 29, from 12:00PM - 5:00PM, CIRSS faculty and students will join GSLIS colleagues to present and share their research in a series of posters, presentations, and demonstrations. University of Illinois Vice Chancellor for Research Peter Schiffer will open the 2013 Research Showcase. The Research Showcase is an annual event open to campus and the general public.

    The full program and event location can be found here: http://www.lis.illinois.edu/research/showcase.

    See below for a list of CIRSS presentations, posters, and demonstrations.


    Presentations

    HathiTrust Research Center: New Frontiers in Digital Scholarship
    J. Stephen Downie, Craig Willis & Kahyun Choi

    Site-Based Data Curation at Yellowstone National Park
    Carole Palmer, Bruce Fouke, Ann Rodman, Sayeed Choudhury, Andrea Thomer, Karen Baker, Abby Asangba & Karen Wickett

    GSLIS at the Text REtrieval Conference—Retrieving and Filtering Real-Time Data
    Miles Efron

    Posters

    Identifying Claims in Social Science Literature
    Shameem Ahmed, Catherine Blake, Kate Williams, Noah Lenstra & Qiyuan Liu

    Describing the Quality of Research Datasets Across Disciplines: A Comparative Study
    Tiffany C. Chao

    Sustainable Software
    Craig Evans & Jerome McDonough

    On the Effect of Name Ambiguity on Measures of Large-Scale Co-Authorship Networks
    Brent D. Fegley & Vetle I. Torvik

    Enhancing Cultural Heritage Collections by Supporting and Analyzing Participation in Flickr
    Jacob Jett, Megan Senseney & Carole L. Palmer

    Location-Based Navigation: Combining OPAC Searching and 3D Visualization in a High-Density Storage Facility
    Fredrick Kiwuwa Lugya & Michael B. Twidale

    Site-Based Data Curation at Yellowstone National Park
    Carole L. Palmer, Bruce Fouke, Ann Rodman, Sayeed Choudhury, Andrea Thomer, Karen Baker, Abby Asangba & Karen Wickett

    When You Wish upon a Blog: How Collaborative Information Seeking can Interleave with CSCW
    Aiko Takazawa & Michael B. Twidale

    Completeness, Coverage, & Equivalence in Scientific Data Records
    Andrea Thomer

    Extending the Systematic Assertion Model for Humanities Research
    Karen Wickett, David Dubin, Bridget Almas & Megan Senseney

    Center for Informatics Research in Science and Scholarship
    Carole L. Palmer, Director

    HathiTrust Research Center
    J. Stephen Downie, Director

    Demonstrations

    The Illinois Distributed Museum Project: Engineering and Technology Innovations at the University of Illinois at Urbana Champaign
    Michael B. Twidale, Susan Frankenberg, Tom Ackerman & Kelsey Heffren

    Exploiting Structural Data for Music Exploration
    Craig Willis, J. Stephen Downie, Kahyun Choi & David Bainbridge

  • 3/13/2013Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    Professor of Library and Information Science and CIRSS Director, Carole Palmer, will deliver a presentation titled, "Setting Priorities for Data Curation Research and Education", at the University of North Carlolina at Chapel Hill School of Information and Library Science on March 20, 2013.

    Abstract:
    "As our data curation research and education initiatives become more established, we are striving to increase impact on practice by sharpening the focus of our programs and building strategic partnerships. The theme of “reuse value” is now firmly at the center of our studies and extended through the core curriculum in our emphases on research cultures, collections, and representation. Moreover, our projects and programs are increasingly dependent on contributions from domain researchers, data centers, repository developers, and practitioners. I will present an overview of our investigations of data practices across more than a dozen sub-disciplines in the earth and life sciences, and our work on aggregating and modeling cultural heritage collections with developers of the Europeana Data Model. I will also introduce two important projects that represent our next phase of development:  a research collaboration to develop site-based curation principles and processes with geobiologists and resource managers at Yellowstone National Park, and an education initiative providing core curriculum and student field experiences in partnership with the National Center for Atmospheric Research."

  • 3/12/2013Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    CIRSS Associate Director Catherine Blake, Assistant Professor Kate Williams, doctoral students Shameem Ahmed and Noah Lenstra, and masters student Qiyuan Liu received an Honorable Mention Award at iConference 2013 for their poster "Identifying Claims in Social Science Literature". The poster demonstrated application of Professor Blake's Claim Framework to social science literature. 

    While the Claim Framework was initially developed to capture how scientists communicate findings from an empirical study this was the first application to social science literature. The group explored the degree to which the Claim Framework could capture claims made in two social science research areas: Community Informatics and Information and Communication Technologies for Development.

    The poster is available here: http://hdl.handle.net/2142/42061

  • 3/12/2013
    A new Jisc funded study on the scholarly practices of academic chemists has been released by Ithaka S+R . The study covers themes such as data management, research collaboration, library use, discovery, publication practices, and research funding.

    Read more

  • 3/5/2013

    Following the success of publishing the most resolved datasets of January, we present you the
    10 most popular datasets of February 2013, registered by DataCite:

  • Number 1: 4226 resolutions
    GenoCAD Training Set I. (2013)
    Jean Peccoud, Laura Adam, Mandy Wilson
    http://dx.doi.org/10.6084/m9.figshare.153827
  • Number 2: 1863 resolutions
    Introduction to the UCSC Genome Browser. (2012)
    Mary Mangan.
    http://dx.doi.org/10.6084/m9.figshare.96258
  • Number 3: 1560 resolutions
    GenoCAD Legacy Grammars. (2013)
    Jean Peccoud, Michael Czar, Yizhi Cai.
    http://dx.doi.org/10.6084/m9.figshare.157107
  • Number 4: 761 resolutions
    Slagvaardig met ICT: Ontwerpprincipes voor leeromgevingen die professionele digitale competenties van hbo-studenten versterken. (2013)
    F.W. Jacobs
    http://dx.doi.org/10.4233/uuid:41e00c81-e909-4301-a80e-9da23327e3f0
  • Number 5: 670 resolutions
    How to design a genetic mating scheme: a basic training package for Drosophila genetics. (2013)
    John Roote, Andreas Prokop.
    http://dx.doi.org/10.6084/m9.figshare.106631
  • Number 6: 533 resolutions
    2nd year Drosophila Developmental Genetics practical. (2013)
    Andreas Prokop.
    http://dx.doi.org/10.6084/m9.figshare.156395
  • Number 7: 528 resolutions
    Agrawal A, Johnson MTJ, Hastings AP, Maron JL (2012)
    Data from: A field experiment demonstrating plant life-history evolution and its eco-evolutionary feedback to seed predator populations.
    Dryad Digital Repository.
    http://dx.doi.org/10.5061/dryad.7h8n3
  • Number 8: 484 resolutions
    ICT4S ICT for Sustainability. (2013)
    Hilty, Lorenz M., Aebischer, Bernard, Andersson, G��ran and Lohmann, Wolfgang.
    Proceedings of the First International Conference on Information and Communication Technologies for Sustainability, ETH Zurich, February 14-16, 2013
    http://dx.doi.org/10.3929/ethz-a-007337628
  • Number 9: 450 resolutions
    git repository for paper on git and reproducible science. (2013)
    Karthik Ram.
    http://dx.doi.org/10.6084/m9.figshare.155613
  • Number 10: 439 resolutions
    Evaluating a general theory of macroecology. (2013)
    Ethan P. White
    http://dx.doi.org/10.6084/m9.figshare.155707

    So the top 10 are 4 datasets, 1 course material, 2 presentations and 3 textual objects, although number 9 could propably count as data about a text-object :-)
    Generally we again had around 500,000 resolutions in total.
    Interestingly none of the top 11 from January is in the top 10 of February, which indicates that the resolutions here come from actual interest in the objects, not just from clicking on the top 11 of January...

  • 3/4/2013

    The Public Participation in Science Research working group today released their Data Management Guide designed specifically for citizen scientists.

  • 2/27/2013

    The DataONE team is happy to announce the release of the first version of the DataONE R Client, our package for accessing open data in DataONE from within the R environment for statistical computing. The DataONE R Client provides the ability to access open ecological, environmental, and earth science data from the DataONE network of repositories, and to save data from within R to DataONE repositories that support write access.

  • 2/20/2013

     

    The Open Annotation Collaboration (OAC) recently announced three public meetings to introduce the Open Annotation Data Model Community Specification. With generous support from the Andrew W. Mellon Foundation, OAC project members will use the meetings to inform digital humanities and sciences computing developers, curators of digital collections and scholars using digital content about the W3C Open Annotation Community Group’s work. OAC Principle Investigator and CIRSS Affiliated Researcher Timothy Cole and OAC Project Coordinator and CIRSS staff member Jacob Jett will assist with workshop delivery.

    Workshop participants will learn about the data model's core features and advanced modules through tutorials, a showcase of existing implementations, Q&A sessions with community implementers and live demonstrations. Topics will include:

    ·         The Open Annotation Data Model

    ·         The W3C Open Annotation Community Group

    ·         Existing implementations

    ·         Developer tools & resources

    Rollout times and places:

    There is no registration fee but RSVP (online) is required. RSVP for a rollout near you using one of the links above or by visiting: http://www.openannotation.org/RolloutInfo.html

    ·         U.S. West Coast Rollout – 09 April 2013 at Stanford University (RSVP - https://www.eventville.com/Catalog/EventRegistration1.asp?Eventid=1010270)

    ·         U.S. East Coast Rollout – 06 May 2013 at the University of Maryland (RSVP - https://www.eventville.com/Catalog/EventRegistration1.asp?Eventid=1010271

    ·         U.K. Rollout – 24 June 2013 at the University of Manchester (RSVP - https://www.eventville.com/Catalog/EventRegistration1.asp?Eventid=1010272

     

     

     

  • 2/20/2013
    The Digital Preservation Coalition (DPC) and Digital Curation Centre (DCC) are delighted to announce a new issue of our joint newsletter ‘What’s New’.

    Read more

  • 2/19/2013

    Some of you have seeen already that DataCite is offering a statistics page for some time now: http://stats.datacite.org.
    This page provides you with detailed information on how many DOI names were registered by which DataCite member or data center, and also offering information about "Resolutions per month".

    This nice feature allows everyone to see how often DataCite DOI names are actually resolved, how often people want to see what is behind a DOI name. In January, we had almost half a million resolutions which is a nice number of usage. We also had even more failed DOI resolutions, which calls for a bit of explanation:
    Every not working resolution is counted as a failure, so if someone mis-types a DOI name in a handle server it is an error, if someone made a wrong hyperlink to a DOI name - by including for example a ")" at the end of the DOI name - it is an error. Most of the other failures actually come from attempts to resolve DOI names before they were registered.

    But going back to the successfull resolutions we can now present you the
    11 (*) most popular datasets of January 2013, registered by DataCite:

    So our 11 most popular datasets in January are: 1 poster, 7 data records and 3 textual records.
    This of course is fine as the scope of DataCite is to make scientific information citable that is not classical scholarly output.
    It also gives a rough idea of the heterogenity of content and of disciplines that DataCite is covering.

    (*) ERRATA: In an earlier version of this article, we had only the top 10 datasets, but the actual number 3 from Dryad slipped through our fingers.

  • 2/6/2013Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    CIRSS doctoral students, faculty, and associated faculty will be well represented at iConference 2013 "Scholarship in Action: Data, Innovation, and Wisdom". Faculty will lead workshops on varied topics such as workforce gaps in data science and data analytics and practical applications of computational scientometrics. CIRSS doctoral students will present on a wide array of topics, from the relationship between a unified ontology and integrated digital preservation systems to the exploration of macroeconomic indicators of data use.

    iConference 2013 is hosted by the University of North Texas and will take place in Fort Worth, Texas. An iConference is an annual gathering of scholars and researchers concerned with critical information issues in contemporary society. iConference participants advance the boundaries of information studies, explore core concepts and ideas, and create new technological and conceptual configurations situated in interdisciplinary discourses.

    Papers
    "Towards a Data Literate Citizenry"     
    Michael B. Twidale, Catherine Blake, Jon Gant, (University of Illinois at Urbana-Champaign, USA)

    Workshops
    "Filling the Workforce Gap in Data Science and Data Analytics"
    http://hdl.handle.net/2142/42501
    Catherine Blake (University of Illinois at Urbana-Champaign, USA), Jeff Stanton (Syracuse University, USA), Ray R. Larson (University of California, Berkeley, USA)

    "Computational Scientometrics: Theory and Applications"
    Cornelia Caragea (University of North Texas, USA), C. Lee Giles (Pennsylvania State University, USA), Lior Rokach (Ben-Gurion University of the Negev, Israel), Vetle I. Torvik (University of Illinois at Urbana-Champaign, USA)

    Presentations
    "What do we mean by preserving Information? Towards a unified ontology of digital resources to support the development of integrated digital preservation systems"
    Simone Sacchi (University of Illinois at Urbana-Champaign, USA)

    "Design Jams in iSchools: Approaches, Challenges and Examples"
    http://hdl.handle.net/2142/42520
    Michael Twidale, Ingbert Schmidt, Jeff Ginger, Peter Organisciak, Christopher Lueg (University of Illinois at Urbana-Champaign, USA)

    "Tech Learning Potluck"
    Michael Twidale, Sally Jo Cunningham, James Howison, Kate Williams, Peter Organisiak, Jeff Ginger, Brittany Smith, Ingbert Schmidt (University of Illinois at Urbana-Champaign, USA)

    Posters
    "Integrating Data Curation Concepts throughout the Project Lifecycle: A WILIS Case Study"
    Cheryl Annette Thompson (University of Illinois at Urbana-Champaign, USA); Joanne Gard Marshall, Amber L. Wells (University of North Carolina at Chapel Hill, USA); Jennifer Craft Morgan (Georgia State University, USA); Susan Rathbun-Grubb (University of South Carolina, USA)

    "How Information Science Professionals Add Value in a Scientific Research Center"
    Christopher Eaker, Erica Johns, Kayla Siddell (University of Tennessee, Knoxville, USA); Andrea Thomer (University of Illinois, Urbana-Champaign, USA)

    "Dispatches, Digests and Doodles: Exploring the Significant Properties of Field Notebooks"
    Andrea Thomer, Nicholas Weber (University of Illinois at Urbana-Champaign, USA)

    "Identifying Claims In Social Science Literature"
    Shameem Ahmed, Catherine Blake, Kate Williams, Noah Lenstra, Qiyuan Liu (University of Illinois at Urbana-Champaign, USA)

    "Lead, Lag or Get Out of the Index: Exploring Macro-economic Indicators of Data Use"
    Nicholas Matthew Weber, Andrea K. Thomer (University of Illinois, Urbana-Champaign, USA)

  • 1/29/2013
    The issue of disciplinary metadata standards - what they are, who's using them, how to use them - has been gaining attention in the RDM community. To support this, the DCC has created a Disciplinary Metadata page for those who need help figuring out what standards might address their own needs.

    Read more

  • 1/22/2013


    Dear DataONE Community
    The winter 2012/2013 issue of DataONENews is now available.
    Features include:

    • PI report
    • An invitation to participate in a DUG survey
    • A feature on our Education Modules
    • CyberInfrastructure and ITK Updates
    • Focus on activities of the Provenance Working Group
    • Member Node description: KNB
    • Outreach Update

    The newsletter can be downloaded here.

  • 1/14/2013

    DataONE, the California Digital Library and partners were awarded a $590,000 grant from the Alfred P. Sloan Foundation to fund further development of the popular Data Management Planning Tool (DMPTool) in 2013. The bulk of the grant will go to the UC Curation Center (UC3) at the CDL to fund improvements to the DMPTool including expanded functionality, training modules, documentation and the creation of an open-source community to sustain the DMPTool in the future.

  • 1/11/2013
    Data management planning is a topic of increasing importance to researchers, their host institutions and research funders alike. Increasing numbers of funders require plans as part of grant proposals; some are already assessing the quality of these plans as part of bid assessment; and some require further planning to be carried out for successful proposals and are discussing ways to monitor the execution of plans and compliance with funder requirements.

    Read more

  • 1/10/2013
    IDCC13 is now fully booked and we are looking forward to welcoming you all to the M��venpick Hotel in Amsterdam.      

    Read more

  • 12/20/2012
    The DCC is pleased to release a case study on the improvement of Research Data Management at Monash University. We're very grateful to Sam Searle, Anthony Beitz, Wilna Macmillan and Andrew Harrison for their input to this.

    Read more

  • 12/19/2012
    I'm very pleased to announce that we're able to make one final extension to the early bird booking rates for IDCC13 (the International Digital Curation Conference), and one final extension to the booking deadline. Both are now extended until January 2nd 2013.

    Read more

  • 12/18/2012

    The Data Curation Education in Research Centers program (DCERC) is a model for data curation education, coupling high-quality curriculum with field experiences in research and data centers. This unique program educates library and information science students at the master's and doctoral-levels, and builds on the strengths of the University of Illinois at Urbana-Champaign Graduate School of Library and Information Science and the University of Tennessee at Knoxville School of Information Science. The two iSchools have partnered with the National Center for Atmospheric Research, a world-renowned data center, to make it possible for students to learn state-of-the-art data management and curation practices. DCERC is funded by a grant from the Institute for Museum and Library Services as part of the Laura Bush 21st Century Librarian Program.

    A video that gives an overview of DCERC is available here: http://www.youtube.com/watch?v=mbX5bvgT1ME

  • 12/17/2012

    From December 10th to 11th DataCite held its 4th General Assembly in Cologne, Germany. At this occasion two new members were elected: DataCite welcomes the National Research Council of Thailand (NRCT) as its 17th full member and the Bejiing Genomic Institute (BGI) as its 5th affiliated member.
    The picture shows the Thai delegation: Soottiporn Chittmittrapap, Petchara Sungkhaworn and Namtip Wipawin in Colgne with Adam Farquhar, President of DataCite and Jan Brase, Managing Agent of DataCite.

    During the General Assembly furthermore 3 DataCite members were awarded with DataCite awards for the highest numbers of DOI registrations in 2012: The DataCite award in bronze goes to the German National Library of Science and Technology (TIB) for over 50,000 DOI names registered in 2012. The DataCite award in silver goes to the Library of the ETH Zurich for over 150,000 DOI names in 2012, and the DataCite award in gold finally goes to the California Digital Library (CDL) for the registration of over 250,000 DOI names in 2012. T

    Finally the General Assembly decided to hold next years summer meeting in combination with the JCDL 2013 in July in Indianapolis.

  • 12/10/2012

    The December 2012 issue of International Innovation includes a focus piece on DataONE. The article can be downloaded here however I recommend taking a look at the full issue online as it also includes covers some related EU projects including LifeWatch, Creative-B and COOPEUS. The issue can be viewed online here.

  • 12/10/2012
    Storage and the potential role of cloud and shared services are issues that arise regularly in the DCC's work with UK universities. We hope, therefore, that a forthcoming event we're planning in collaboration with colleagues in JISC and the JANET brokerage on storage service requirements will prove to be of interest to many.  

    Read more

  • 12/7/2012
    The International Journal of Digital Curation, Volume 7, Issue 2 has been published.

    Read more

  • 12/6/2012
    "Infrastructure, Intelligence, Innovation: driving the Data Science agenda" .......So what is a Data Scientist? The IDCC13 Programme will address this question in a Symposium to be held on Tuesday 15 January.  

    Read more

  • 12/4/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    Dr. Karen Wickett will present "Representing Identity and Equivalence for Scientific Data" at the American Geophysical Union (AGU) Fall Meeting. Taking place December 3-7 2012 in San Francisco, California, the AGU Fall Meeting is the largest worldwide conference in the geophysical sciences, attracting more than 20,000 Earth and space scientists, educators, students, and other leaders.

    Wickett will address the issues of equivalence and identify in the representation of scientific data with two conceptual models developed out of the Data Concepts Group at the Center for Informatics Research in Science and Scholarship (CIRSS), University of Illinois at Urbana Champaign. The presentation is based on a paper coauthored with CIRSS Doctoral Student Simone Sacchi, and CIRSS affiliated faculty members David Dubin and Allen Reneaer.

    The American Geophysical Union (AGU), a Washington, DC-based international non-profit scientific association with more than 60,000 members, works on a broad spectrum of scientific topics that span all of the Earth and space sciences. Established in 1919 as a committee within the National Research Council of the National Academy of Sciences, AGU was independently incorporated in 1972. Since its founding, the organization has remained dedicated to advancing Earth and space sciences.

     

     

  • 11/30/2012

    Attending the AGU meeting next week? Interested in learning more about DataONE?
    Here is a run down of DataONE related activities by day.

    All Week:
    DataONE will have a section at the NEON booth (#108) in the exhibit hall.
    Learn more about DataONE and about NEON!

  • 11/28/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    CIRSS Faculty, PhD students, and staff will be making a strong showing at the 8th International Digital Curation Conference (IDCC). Taking place January 14-16, 2013, IDCC brings together an international group of those who create and manage data and information, those who use it and those who research and teach about curation processes. Three students who participated in the Data Curation Education in Research Centers (DCERC) program - University of Tennessee School of Information Sciences students Christopher Eaker, Kayla Siddell, and Erica Johns - will also present posters at the conference.

    Find below a full list of CIRSS participation at IDCC:

    "The Product and System Specificities of Measuring Impact: Indicators of Use in Research Data Archives"
    Nicholas Matthew Weber (1); Andrea K. Thomer (1); Matthew S. Mayernik (2); Robert E. Dattore (3);Zaihua Ji  (3); Steven Worley (3) Organizations: 1: University of Illinois - Urbana Champaign, Center for Informatics Research in Science and Scholarship.; 2: National Center for Atmospheric Research- Library; 3: National Center for Atmospheric Research - CISL / RDA

    "Model Development for Scientific Data Curation Education"
    Mary Marlino (1); Karon Kelly (1); Matthew S. Mayernik (1); Suzie Allard (2); Carol Tenopir (2); Carole L. Palmer (3); & Virgil E. Varvel Jr. (3) Organizations: 1: National Center for Atmospheric Research; 2: University of Tennessee at Knoxville; 3: University of Illinois at Urbana-Champaign, Center for Informatics Research in Science and Scholarship

    "The tail and the telling: Distribution of NSF funding 2008-2011"
    Weber, Nicholas Matthew; Organization: University of Illinois - Urbana Champaign, United States of America

    "Describing the quality of research datasets across disciplines: a comparative study"
    Chao, Tiffany Organization(s): University of Illinois, United States of America

    "Levels of Services and Curation for High-Functioning Data"
    Choudhury, G Sayeed (2); Palmer, Carole (1); Baker, Karen S (1); DiLauro, Timothy (2)  Organization(s): 1: University of Illinois Urbana-Champaign, United States of America; 2: Johns Hopkins University

    "Where Have All the Scientific Data Gone: LIS Perspective on the Data At Risk Predicament"
    Thompson, Cheryl Annette; Robertson, William Davenport; Greenberg, Jane; Organization: University of North Carolina, United States of America

  • 11/28/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    Nic Weber was recently profiled for his experience working at the National Center for Atmospheric Research (NCAR) as part of the Advanced Studies Fellowships Program.  During his time at NCAR, Weber studied how open-source climate models were developed among teams of software engineers and conducted an infometrics study of data re-use. Weber is the first information science student to earn a spot in the Advanced Studies Fellowship Program. 

    Read more here: http://www.lis.illinois.edu/articles/2012/11/weber-works-climate-modeling-ncar-scientists

  • 11/28/2012
    The DCC is recruiting - we're looking for a developer to join the team in Edinburgh.

    Read more

  • 11/19/2012
    • Keen to hear about cutting edge research from BBSRC’s Innovator of the year?
    • Interested in integrating your research information and research data systems?
    • Want to learn about RDM in practice from those at the coalface?
    Well then get yourself along to the final DCC roadshow in Dundee on 5-6 December!

    Read more

  • 11/15/2012
    Only just over 8 weeks to go until IDCC13 in Amsterdam! Don’t forget to register soon to take advance of the current early bird registration fees and to book for the workshops of your choice as places may be limited.

    Read more

  • 11/7/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    Jacob Jett recently delivered a presentation as part of the "Using Open Annotation" panel at Digital Library Federation(DLF) Forum 2012.

    This panel examined scholarly digital annotation models, tools and services, focusing on recent work by the Open Annotation Collaboration and the W3C Open Annotation Community Group to improve the interoperability of annotation applications and to make annotations themselves more conformant with Linked Open Data principles.

    Panel members also provided an overview of the Open Annotation data model and described three exemplars of interoperable digital annotation applications built with funding from the Andrew W. Mellon Foundation: Annotation framework for Fedora, Automated annotation of biomedical text, and Digital Emblematica.

    For more information on the panel presentation visit here:
    http://www.diglib.org/forums/2012forum/using-open-annotation/

  • 11/7/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    Katrina Fenlon recently facilitated a discussion on applications of topic modeling to humanities research questions. Held November 3 at the Maryland Institute for Technology in the Humanities, and funded by the National Endowment for the Humanities, the Topic Modeling for Humanities Research workshop provided an "opportunity for cross-fertilization, information exchange, and collaboration between and among humanities scholars and researchers in natural language processing on the subject of topic modeling applications and methods".

    Further information about the workshop, including a bibliography of relevant resources and archived video/audio, is available at http://mith.umd.edu/topicmodeling

  • 11/7/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    The IMLS Digital Collections & Content (DCC) team has received a $50,000 sub-award from the Digital Public Library of America’s (DPLA) Secretariat to further refine the team’s DPLA Beta Sprint prototype. The Beta Sprint prototype was originally developed during Summer 2011 as part of the DPLA Beta Sprint. The prototype was selected as one of six finalist projects showcased during the DPLA’s Fall 2011 Plenary Meeting in Washington, D.C.

    Bringing together resources from over 1000 cultural heritage collections across the U.S., the prototype builds on the DCC team’s knowledge and experiences aggregating metadata records from varied institutions ranging from libraries and archives to museums and historical societies. The new sub-award will allow the DCC team to make further refinements to the prototype’s information retrieval algorithms and implement additional layers of interactive functionality that allow users to interact more directly and dynamically with the prototype’s data.

  • 11/1/2012


    The Fall 2012 issue of DataONENews, the recently redesigned newsletter, is out.

    Features include:

    • Message from the DataONE Project Director
    • Updates from the project
    • focus pieces on some of our Working Groups and Member Nodes
    • Outreach Update
    • and more!

    Click Here to download.

  • 10/31/2012

    DataCite���s Business Models Principles document is now available for download. Intended for DataCite members and their clients, the document looks at DataCite member and client responsibilities, as well as best practices.

    DataCite members may consult the Principles to learn about their responsibilities regarding contracts, costs and fee schedules, and prefixes. DataCite clients can use the document to understand their responsibility to ensure data persistence and to provide metadata. As well, they can learn about best practices such as landing pages and DOI syntax.

    This document will be reviewed at least annually and may be updated more frequently, as necessary. Have a look!

  • 10/30/2012
    The Digital Preservation Coalition (DPC) and Digital Curation Centre (DCC) are delighted to announce a new issue of our joint newsletter ‘What’s New’.

    Read more

  • 10/25/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    CIRSS Associate Director Catherine Blake, along with Jeffery Stanton of Syracuse University, and Joshua Blumenstock of the University of Washington have organized a workshop on Sociotechnical Data Analytics to be delivered at iConference 2013.

    Sociotechnical Data Analytics (SODA) emphasizes social (economic, ethical, policy, and political) and technical (databases, social networking, data mining, and text mining) aspects of data analytics that are required to fully realize the potential of the Fourth Paradigm of science.

    The workshop will provide a forum for iSchool faculty who are developing programs in data analytics, eScience, eResearch, big data, and cyberinfrastructure to develop best practices with respect to preparing students to fill the workforce needed for managers and analysts to analyze big data and make decisions based on their findings.

    The iConference is an annual gathering of scholars and researchers concerned with critical information issues in contemporary society. iConference participants advance the boundaries of information studies, explore core concepts and ideas, and create new technological and conceptual configurations—all situated in interdisciplinary discourses.

  • 10/25/2012
    The IDCC13 Workshops Programme is now available. The workshops will be held on Monday 14 January and Thursday 17 January 2013 at the M��venpick Hotel, in Amsterdam City Centre. The majority of workshops will be free of charge and include refreshments and lunch for all attendees. Registration is open.

    Read more

  • 10/22/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    CIRSS Faculty, Affiliated Faculty, PhD Students, and Staff will be making yet another strong showing at ASIS&T's annual meeting. Taking place October 26-30, 2012 the ASIS&T Annual Meeting is a primary venue for disseminating research centered on advances in the information sciences and related applications of information technology.

    Find below a list of CIRSS presentations, papers, posters, and workshops:

    Unreliable and Uncertain Annotators: Evaluating Rater Quality and Rating Difficulty in Online Annotation Activities
    Organisciak Peter, Efron Miles, Fenlon Katrina and Megan Senseney

    http://www.asis.org/asist2012/abstracts/166.html

    Identifying Content and Levels of Representation in Scientific Data
    Karen Wickett, Simone Sacchi, David Dubin and Allen Renear

    http://www.asis.org/asist2012/abstracts/199.html

    Value and Context in Data Use: Domain Analysis Revisited
    Nicholas Weber, Karen Baker, Andrea Thomer, Tiffany Chao and Carole Palmer

    http://www.asis.org/asist2012/abstracts/168.html

    Tooling the Aggregator's Workbench: Metadata Visualization Through Statistical Text Analysis
    Katrina Fenlon, Miles Efron and Peter Organisciak

    http://www.asis.org/asist2012/abstracts/161.html

    Enhancing Cultural Heritage Collections by Supporting and Analyzing Participation in Flickr
    Jacob Jett, Megan Senseney and Carole Palmer

    http://www.asis.org/asist2012/abstracts/287.html

    Combined Methods, Thick Descriptions: Languages of Collaboration on Github
    Nicholas Weber

    http://www.asis.org/asist2012/abstracts/347.html

    The Data-at-Risk Initiative:  Analyzing the Current State of Endangered Scientific Data
    Angela P. Murillo, Cheryl Thompson, Nico Carver, W. Davenport Robertson, Jane Greenberg and William Anderson
    http://www.asis.org/asist2012/abstracts/346.html

    Complications in climate data variable naming

    Nic Weber, Andrea Thomer, and Gary Strand

     

     

  • 10/22/2012

    GSLIS student and CIRSS Graduate Assistant, Thomas Padilla recently presented, "Preservationistas: Online Communities, Activist Preservation, and Born-Digital Archives" at the Archives and Activism Symposium in New York City. Held October 12, 2012 at The New School, the symposium brought together leaders in the archival activism space to discuss recent projects and future directions for archival activism.

    This symposium was dedicated to the memory of archivist and historian Michael Nash (1946-2012).

  • 10/18/2012

    DataCite offers various services supporting the DOI registration and discovery of research data. These services include the Metadata Store, a statistics portal, the OAI-PMH Provider and many more. Please have a look at our Services website for more information!

  • 10/11/2012
    Registration for the Dundee roadshow is now open! The event takes place on 5-6 December in the Library and Learning Centre, University of Dundee. Further information, including a draft programme, is available on the event page. Register here to join us in Dundee!

    Read more

  • 10/10/2012

    DataCite will host open hours for data centres on Tuesday, 16th October 2012. 5pm Berlin time. Please look here for your local time.

    We want to gather feedback from the end users, listen to suggests, answer potential questions etc. In general this meeting is open, informal and a friendly way to build relationships with our end users. The communication language will be English.

    Please do login at https://webconf.vc.dfn.de/dc-open on October 16th. This is password free and in general open.

  • 10/9/2012vv writer vv by vv published by vv

    As part of the National Center for Professional and Research Ethics - Ethics Awareness Week, Virgil Varvel of the Center for Informatics Research in Science and Scholarship will be participating as part of a panel being led by Nicholas Burbulas on When IRBs Do Not Agree: On Campus, Across Universities, Internationally.

    When IRBs Do Not Agree: On Campus, Across Universities, Internationally
    Time: 1:00 PM
    Place: Illini Union 209
    Institutional Review Board (IRB) approval is a requirement for virtually all research involving human subjects. But there are challenges confronting IRB review in three types of collaborative settings: interdisciplinary research within an institution, research that spans different institutions with multiple IRBs, and international research that involves other countries with different research traditions. How can researchers navigate these tricky waters?

     

    Panelists
    Nicholas Burbules is a Professor of Educational Policy Studies and is an affiliate of the Unit for Criticism and Interpretative Theory. He is a nationally preeminent scholar in the field of philosophy of education and is internationally recognized for his work focusing on the use of information technology in education. His current research includes project related to on ethical and policy issues concerning new technologies in education; virtual reality; collaboration; and dialogue and "third spaces."

    Ms. Anita Balgopal is the Director of the Institutional Review Board at the University of Illinois. Anita has served as the IRB Director at the University of Chicago's School of Social Service Administration and Chapin Hall Center for Children. Prior to this, she was the IRB Manager at Northwestern University's Office for the Protection of Research Subjects.

    Dr. Virgil Varvel is a Research Analyst for the Center for Informatics Research in Science and Scholarship (CIRSS). His research has been primarily on various aspects of distance education. His current work explores pedagogical assumptions of socially organized versus independent study instructional design in distance education.

    John Caughlin is a Professor and Associate Head in the Department of Communication at the University of Illinois at Urbana-Champaign. Research and teaching interests: family communication, privacy and secrets, disclosure of HIV/AIDS. Professor Caughlin recently received the Bernard J. Brommel Award for Outstanding Scholarship in Family Communication from the National Communication Association.


    http://ethics.grainger.illinois.edu/IRBpanel.asp

     

  • 10/3/2012

    The Institute for Museum and Library Services (IMLS) has awarded $499,919 for the project, Site-Based Data Curation for Small Science, led by CIRSS Director Carole Palmer, with co-Principal Investigators Bruce Fouke, Professor in Geology, Microbiology, and the Institute for Genomic Biology (IGB) at the University of Illinois at Urbana-Champaign; Sayeed Choudhury, Hodson Director of the Digital Research and Curation Center and Associate Dean of Library Digital Programs at Johns Hopkins University; and Ann Rodman, Director of GIS Operations at Yellowstone National Park.

     Bringing together experts in data curation, data repositories, geobiology, and research site management, the Site-Based Data Curation (SBDC) project will investigate and test curation policies and procedures to advance the transfer of long-tail digital data collected at Yellowstone National Park (YNP) to the Data Conservancy for preservation and access, and to better coordinate the management of data resources produced at the many scientifically significant sites at YNP. The framework will result in a general model of professional curation processes readily extendible to other national parks and other important research sites, especially cradles of biodiversity such as coral reefs and deep crustal biosphere locations.

    The new data curation approaches will be integrated into the curriculum of the Specialization in Data Curation at GSLIS and undergraduate and graduate geobiology courses taught at Illinois, with educational outreach extended to Yellowstone. The education activities will advance data curation workforce expertise in handling complex, cross-disciplinary data and prepare scientific communities to contribute to and take advantage of diverse collections of curated data.

    The SBDC framework is an important step forward in evolving the professional best practices and institutional collaborations needed to build large-scale, interoperable data collections that include high-functioning long-tail data and are responsive to the pressing data needs of practicing research communities and resource management at sites of data production.

     

  • 10/2/2012

    October 2nd 2012 - The University of California���s Digital Library (CDL) and its partners today launched DataUp, a free data management tool.

    Researchers struggling to meet new data management requirements from funders, journals and their own institutions can now use the DataUp web application and a Microsoft Excel add-in to document and archive their tabular data.

    ���DataUp will change the way scientists do their work, making it easy for them to manage and preserve their spreadsheet data for future use,��� said Bill Michener, principal investigator for the DataONE project.

  • 9/27/2012
    The Call for posters and demonstrations will close on 29 October 2012  

    Read more

  • 9/25/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    CIRSS Associate Director Catherine Blake recently  presented "Using Secondary Information to inform Public Policy" at the Oxford Internet Institute's "Internet, Politics, Policy 2012" conference. 

     Taking place September 20-21, the conference explored the new research frontiers opened up by Big Data as well as its limitations. The conference also aimed to serve as a forum to encourage discussion across disciplinary boundaries on how to exploit Big Data to inform policy debates and advance social science research.

     More information about Catherine Blake's presentation is available here:

    http://microsites.oii.ox.ac.uk/ipp2012/catherine-blake-using-secondary-information-inform-public-policy

     

  • 9/12/2012
    Registration for IDCC13 is now open and the draft programme is available.  

    Read more

  • 9/7/2012
    JISC have released a white paper on Curation in the Cloud, which looks at the potential and practicalities of using cloud-based solutions for the curation and long-term preservation of digital materials, focusing particularly on data that originates from research or that supports research processes.

    Read more

  • 9/6/2012
    The Digital Preservation Coalition (DPC) and Digital Curation Centre (DCC) are delighted to announce a new issue of our joint newsletter ‘What’s New’.

    Read more

  • 9/5/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    Carole Palmer, CIRSS Director and Professor of Library and Information Science, will present “The Analytic Potential of Long-Tail Data: Sharable Data and Re-use Value” at Wolfram Data Summit 2012.  

    Taking place September 6 – 7  in Washington, DC, Wolfram Data Summit 2012 is an invitation only event that offers leaders of the world's data repositories an opportunity to meet, to share insights into their work, and to discuss challenges and opportunities facing the data community. The third annual summit will place an emphasis on content, rather than infrastructure, in areas such as: data from social media, location-based data, freeing health care data, data narratives, news as data, natural language processing, government and election data, corporate data silos, culturomics, bibliometric data, data conservation, and semantic data.

    More information is available here:

    http://www.wolframdatasummit.org/2012/

     

  • 8/27/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    Library Journal recently profiled Lynn Yarmey (MS ’11) for the “Librarians Abroad” series, which highlights professionals working outside traditional library and archives settings.

    While at GSLIS Yarmey completed the Specialization in Data Curation and through the CIRSS DCEP Program had the opportunity to participate in an internship at the National Snow and Ice Data Center (NSIDC)  in Boulder, Colorado.

    Yarmey is now the Senior Data Curator at NSIDC for the NSF-funded Advanced Cooperative Arctic Data and Information Service (ACADIS) project. 

    Read below for details about Yarmey’s work and thoughts on her GSLIS education:


    The Data Curation Specialization at the Univ. of Illinois Urbana Champaign Graduate School of Library and Information Science (GSLIS) was a great fit for me. The program offered strong theoretical underpinnings and brought my practical experience into a much broader context. The LEEP distance option meant I could keep working in the labs while learning about the theory, a perfect balance for my data work.

    I love that I get to do a little bit of everything! I am involved with two Agile development teams, user interface and design work, refining dataflows, scoping and implementing metadata profile updates, data management education, defining new projects and products in support of Arctic scientists, and tying all of these together in presentations, papers, and proposals . . . My role is almost as diverse as the data. Every day I am bridging between so many different groups of fantastic people to promote understanding across the project horizontals, and then vertically mapping the detailed daily operations with the project vision. I am a jack-of-all-trades type of person much more than a deep specialist; my work allows me to use and celebrate that quality.

  • 8/23/2012Thomas Padilla writer Thomas Padilla by Thomas Padilla published by Thomas Padilla

    GSLIS Interim Dean and affiliated CIRSS faculty member Allen Renear, will deliver the opening keynote at the National Information Standards Organization (NISO) Forum: Tracking it Back to the Source: Managing and Citing Research Data.

    Taking place September 24, 2012 in Denver, Colorado, the NISO forum will address challenges posed by the exponential rise of data creation across nearly all scholarly disciplines by focusing on several new initiatives to improve community practice on data citation and data discovery.

    More information is available here:

    www.niso.org/news/events/2012/tracking_it_back_to_the_source/

     

  • 8/23/2012Virgil Varvel writer Virgil Varvel by Virgil Varvel published by Virgil Varvel

    Catherine Blake will present at the Center for the Analysis of Cellular Mechanisms and Systems Biology, Montana State University, Bozeman Montana.

    Her presentation, Claim Jumping: Bridging disciplinary boundaries using the Claim Framework, will be a part of the three day workshop, Making Sense of Biological Systems: Using Knowledge Mining to Improve and Validate Models of Living Systems from August 23-25. 

    More information is available: http://www.chemistry.montana.edu/cobre/workshop/Program.html

  • 8/16/2012
    The DCC are pleased to announce that the Paper Submission date for IDCC13 has been extended by 10 days. The Call will now close at 12 noon BST on Thursday 30 August 2012.   #next_pages_container { width: 5px; hight: 5px; position: absolute; top: -100px; left: -100px; z-index: 2147483647 !important; }  

    Read more

  • 8/15/2012
    Aligning National Approaches to Digital Preservation is a new publication from Educopia, which establishes a set of starting points for building a greater alignment across digital preservation, and highlights the need for strategic international collaborations.

    Read more

  • 8/10/2012
    Sally Rumsey, Digital Collections Development Manager at the Bodleian Libraries, University of Oxford, reviews the book Managing Research Data, edited by DCC's Graham Pryor.

    Read more

  • 7/27/2012

    CIRSS is one of three collaborators on a recently announced NEH award, led by the Maryland Institute for Technology in the Humanties (MITH), to develop a series of data curation workshops for humanities scholars, librarians and archivists.  The NEH award announcement and further details on the project are reproduced below from MITH PI Trevor Munoz's 26 July 2012 blog post:

    MITH is pleased to announce an award from the National Endowment for the Humanities 2012 Institutes for Advanced Topics in the Digital Humanities competition for a series of workshops on data curation for humanities scholars, librarians, and archivists interested in sustaining meaningful access to humanities research materials.

    The Digital Humanities Data Curation Institutes project, directed by Trevor Munoz, Associate Director of MITH and Assistant Dean for Digital Humanities Research, University Libraries, will facilitate a multi-institutional collaboration between MITH and the University Libraries at the University of Maryland, the Women Writers Project (WWP) at Brown University, and the Center for Informatics Research in Science and Scholarship (CIRSS) at the Graduate School of Library and Information Science (GSLIS), at the University of Illinois, Urbana-Champaign to provide three workshops during 2013.

    The practice of cutting-edge humanities research increasingly involves acquisition, synthesis, and management of data in digital form. The theoretical knowledge and practical skills of information science, librarianship, and archival science represent a vital component of the skill set that will be required to succeed in the rapidly transforming landscape of the academy and the wider society.

    Digital Humanities Data Curation institutes will serve as opportunities for participants with all levels of expertise from beginners to the most advancedto receive guidance in understanding the role of data curation in enriching humanities research projects. By the conclusion of each institute, participants will be adept at formulating solutions for existing challenges and will be able to document their data curation strategies in the form of data curation plans and strategic risk assessments, key elements of innovative digital scholarship.

    A core resource for the Institute will be the Digital Humanities Curation Guide (DH Curation: http://guide.dhcuration.org) developed at GSLIS. The Guide allows instructors and participants to share scholarly knowledge about literature, tools, projects, and standards relevant to curating humanities data. A forum through which knowledge developed at the institute can be shared with the broader research community, the Guide will allow for the aggregation of resources and responses from across the Institutes three events. Julia Flanders (WWP) and Dorothea Salo (Faculty Associate in the School of Library and Information Studies at the University of Wisconsin at Madison) will serve as co-instructors alongside Muoz for the three institute events and will contribute resources to the Guide.

    Applications to join this cohort of scholars focused on discipline-specific curation practices and skills will be announced in late fall 2012 with the Institute beginning in Spring 2013. For more information, please visit: http://mith.umd.edu/research/project/data-curation/. 

    [Reproduced from MITH PI Trevor Munoz's 26 July 2012 blog post.]

  • 7/16/2012
    The Digital Preservation Coalition (DPC) and Digital Curation Centre (DCC) are delighted to announce a new issue of our joint newsletter ‘What’s New’.

    Read more

  • 6/21/2012
    The Digital Preservation Coalition (DPC) and Digital Curation Centre (DCC) are delighted to announce a new issue of our joint newsletter ‘What’s New’.

    Read more

  • 6/21/2012

    CIRSS RA Ashley Clark will be presenting her paper entitled "Meta: Exploring the Provenance of XSL Transformations" at Balisage in early August. The paper is the product of her work with the Data Curation Education Program in the Humanities. The following is the article abstract.
    "When documents are transformed with XSLT, what methods can be used to understand and record those transformations? Though they aren't specifically meant for provenance capture, existing tools and informal practices can be used to manually piece together the provenance of XSLTs. However, a meta-stylesheet approach has the potential to generate provenance information by creating a copy of XSLT stylesheets with provenance-specific instructions. This method is currently being implemented, using the strategies and workflows detailed here. Even with the complications and limitations of the method, XSLT itself enables a surprising amount of provenance capture."
  • 6/21/2012

    The DH Curation Guide will be officially launched during the Digital Humanities 2012 conference, held July 16-22 at the University of Hamburg in Germany. A product of the Data Curation Education Program for the Humanities (DCEP-H), funded by IMLS, the DH Curation Guide is an online educational resource for students and professionals that offers expert-written articles about digital humanities data curation concepts. (http://guide.dhcuration.org/) Trevor Muñoz (UMD) and Robin Davis, both alums of CIRSS, and Julia Flanders (Brown) are the managing co-editors of the project, which features six articles written by contributing editors who are experts in their fields, with six more articles coming soon. The web site, currently available as a beta release, has seen a positive response from the data curation and digital humanities communities, with over 500 page views since May 8 and supportive feedback through Twitter (@dhcuration).

  • 6/19/2012

    CIRSS Associate Directory Cathy Blake has been awarded a Laura Bush 21st Century Librarian Program grant from The Institute of Museum and Library Services totaling $498,777 to create a specialization in Sociotechnical Data Analytics (SODA) within both the master’s and doctoral degrees.

    "One of the exciting aspects about the SODA education program is the dual emphasis on social and technical aspects of data analytics," Blake, who will serve as principal investigator on the project. In addition to the mathematical modeling that typifies data analytics, students who graduate from the GSLIS SODA program will also understand the social, ethical, and policy aspects of big data. "That combination will make students uniquely prepared to fill the growing workforce gap in people who can effectively manage and analyze big data—a gap that, according to The McKinsey Global Institute report on Big Data, will culminate in a shortfall of 1.5 million data-savvy managers and analysts by 2018," she said.

    The SODA research group was formed in 2010. The group, which includes faculty members Jana Diesner, J. Stephen Downie, Miles Efron, Brant Houston, Jerome McDonough, Vetle Torvik, and Michael Twidale, is part of the Center for Informatics Research in Science and Scholarship (CIRSS), where Blake serves as associate director. SODA research explores how to best design, develop, and evaluate new technologies in order to better understand the dynamic interplay between information, people, and technology. Group members conduct research in information retrieval, data and text mining, knowledge discovery, social computing, collaboration, and most recently network analysis.

    "I am thrilled that we will now be able to formalize an educational program that mirrors the outstanding faculty research," Blake said. "In addition to our faculty, we have some great partners who will enable us to better integrate real-world data sets into the classroom, and augment the classroom experience with a hands-on practicum and projects where students work side-by-side with scientists and business analysts."

    The new program will complement an existing Specialization in Data Curation led by Carole Palmer, GSLIS professor and director of CIRSS, as well as the Certificate of Advanced Study in Digital Libraries led by Jerome McDonough. "SODA is just one more piece in the evolving constellation of programs that give the next generation of information professionals the expertise they need to thrive in the information age," said Blake.

  • 6/11/2012

    "Centuries of Knowledge: Graduate School of Library and Information Science Data Curation Education Program" (DCEP), one of our most successful and long-running grant projects, concluded this December, and the final report is now available through IDEALS: http://hdl.handle.net/2142/30845.
     
    DCEP was funded by the Institute for Museum and Library Services (RE-05-05-0036) and was designed to increase educational and research capacity in data curation at GSLIS. In the first year of the project we developed the Data Curation Education Program, a specialization within our Master of Science degree program.  As of May 2012, 54 students have completed the specialization, and our alumni have gone on to secure positions in a variety of institutions, including research centers, academic libraries, government agencies, and corporate industry.
     
    In developing the curriculum for the specialization, we also created two new courses: Foundations of Data Curation, a survey course on the emerging field, and Digital Preservation. Along with Systems Analysis and Management, they constitute the core required courses in the specialization, and they are consistently enrolled at capacity each semester.
     
    In 2008, we conducted the first annual Summer Institute on Data Curation for practicing information professionals, facilitating the development of a community of practice across U.S. and Canadian academic and research organizations. Our outreach and service activities have led to a range of new partnerships that have resulted in student fieldwork opportunities and new collaborative research and education activities at CIRSS, resulting in four successful grant proposals.

  • 6/8/2012
    The DCC in collaboration with the wider Research Data Management community is organising a workshop on insitutional data repositories on July 9, in conjunction with Open Repositories 2012, to be held in Edinburgh.

    Read more

  • 6/8/2012
    A draft whitepaper of the Community Capability Model Framework for data-intensive research, developed by UKOLN and Microsoft Research, is available for comment.

    Read more

  • 6/7/2012
    Last year in April 2011, the Engineering & Physical Sciences Research Council (EPSRC) set out its research data management expectations for institutions in receipt of EPSRC grant funding, which included the development of an institutional ‘Roadmap’. Today, on 7 June 2012, the University of Bath has announced the publication of its Roadmap, which you can now view and download (PDF).

    Read more

  • 6/4/2012

    4 June 2012 - 19 February 2013

    Read more

  • 5/15/2012
    The Digital Preservation Coalition (DPC) and Digital Curation Centre (DCC) are delighted to announce a new issue of our joint newsletter ‘What’s New’.

    Read more

  • 5/14/2012
    The Digital Curation Centre is pleased to announce the launch of DMP Online v3.0 This new release marks a major progression in the software’s functionality. For the first time users can create data management plans incorporating multiple templates, so if your institution, your funder and your publisher all require data management plans, you can now create a single plan to satisfy them all.

    Read more

  • 5/2/2012
    The DPC, Richard Wright and Charles Beagrie Ltd have recently announced (30 April 2012) the release of the latest Technology Watch Report ‘Preserving Moving Pictures and Sound’, written by Richard Wright, formerly of the BBC.

    Read more

  • 4/30/2012
    In collaboration with the Research Information Network (RIN) and the Society of College, National and University Libraries (SCONUL), Vitae is pleased to announce the publication of The informed researcher booklet and an Information literacy lens on the Vitae Researcher Development Framework.

    Read more

  • 4/27/2012

    Ana Lucic will be defending her CAS project entitled "Characterizing Authorship Style: Contrasting Linguistic and Statistical Strategies."

    Abstract: Of the five categories of features used in authorship attribution studieslexical, character, syntactic, semantic, application-specific (Stamatatos, 2009, p. 540)it is the lexical and character features that have been tested most extensively. Syntactic, semantic, and application-specific features, also called high-level features, have been tested with far less consistency. The process of extraction of high-level features is more complex, usually depends on the availability of a parser, and generally is not as reliable as the extraction of surface-level features. High-level features, however, provide a more complex view of the text and thus have the potential to be a stronger marker of difference between different authorial styles than surface-level features. In this study, we explore the potential of syntactic dependencies which hold between two words in a sentence to separate writing styles in an authorship attribution study which uses a collection of movie reviews downloaded from the film database service imdb.com (Seroussi, 2010). Rather than focusing on syntactic dependencies on the level of the entire text of the review, we focus on personal names and on syntactic dependencies which occur immediately before and immediately after personal names. The references to personal names and the grammatical structures that govern these references thus become the key feature for this analysis. The exploratory principal component analysis conducted on this feature revealed its high variability, which speaks to the potential of diverse ways reviewers refer to people to be a strong marker of difference between authorial styles.
     
    The final manuscript is available from the front office.


    Date: Thursday, May 10
    Time: 2-4pm
    Location: Room 242
    Project Advisor: Dr. Catherine Blake
    Committee Members: Dr. David Dubin, Dr. John Unsworth


  • 4/11/2012

    GSLIS Professor and CIRSS Director Carole Palmer recently shared her thoughts in the University of Illinois feature, "A Minute With . . .," following the Obama administration's announcement of a $200 million research initiative in "big data" computing:

    • Informatics is about methods and strategies for using information in organizations, networks, cultures, and societies. Our job is to make advances that help people get access to and work with information to solve problems and make new discoveries.
    • The definition of data curation that we promote is the active and ongoing management of data through its life cycle of interest and usefulness to scholarship, science, and education.
    • Data are very valuable assets—the raw materials of research—with tremendous potential for re-use in new and innovative ways. But digital data are high risk—extremely fragile and with few standards of good practice.
    • We study how to collect and add value to data, to promote sharing and integration across institutions and fields of research, looking at both technical and social problems in making data a collective, shared resource.
    • The Data Conservancy (http://dataconservancy.org) is a large multi-institutional collaboration led by Johns Hopkins University. We are partners, contributing to research and education through our data curation initiatives at CIRSS.

    GSLIS has been at the forefront of data curation education since launching its specialization within the Master of Science degree in 2006, beginning with a focus on the sciences and expanding to include the humanities in 2008. Currently, more than 50 students enroll each year in the Foundations of Data Curation course, with many completing the GSLIS Specialization in Data Curation (http://www.lis.illinois.edu/academics/programs/ms/data_curation).
     
    The interview with Professor Palmer was conducted by Dusty Rhodes, news editor for the U of I News Bureau.  Read the full interview at http://illinois.edu/lb/article/72/62055.

  • 4/10/2012

    CIRSS researchers presented two posters at the third annual annual ASIS&T Research Data Access and Preservation (RDAP) Summit (http://rdap12.posterous.com), held 22-23 March 2012 in New Orleans, LA.  Topics explored at this year's summit included data management plans and policies; training of data management practitioners; discovery of research data; data curation service models; sustainability of data management; and data curation.

    The two posters report on CIRSS activities on the Data Conservancy project (http://dataconservancy.org), funded by NSF and led by partners at Johns Hopkins University.

    -------------

    What Dataset Descriptions Actually Describe: Using the Systematic Assertion Model to Connect Theory and Practice
    Karen Wickett, Andrea Thomer, Simone Sacchi, Karen S. Baker, David Dubin
    Available at: http://hdl.handle.net/2142/30470

    Scientific data is encoded and described with the aim of supporting retrieval, meaningful interpretation and reuse. Encoding standards for datasets like FGDC, DwC, EML typically include tagged metadata elements along with the encoded data, suggesting that, per the Dublin Core 1:1 principle, those elements apply to one and only one entity (a specimen, observation, dataset, etc.).  However, in practice vocabularies are often used to describe different dimensions of scientific data collection and communication processes. Discriminating these aspects offers a more precise account of how symbols and the propositions they express acquire the status of “data” and “data content,” respectively.

    In this poster we present an analysis of species occurrence records based on the Systematic Assertion Model (SAM) [DWS]. SAM is a framework for describing the encoding and representation of scientific data, bridging the gap between data preservation models and discipline-specific scientific ontologies. The model is intended to be general enough for any scientific domain, and not bound to any particular methodology or field of study. Since species occurrence records are a kind of data that is frequent re-used, migrated across systems and shared they are a good target for analysis.

    Sample data is reviewed in the context of SAM, and analyzed with respect to the provenance events, entities, and relationships governing our definitions of data and data content. The exercise serves to:

    1. highlight targets for data description (expression, content, assertion, justification).

    2. inform the discovery of anomalous or missing contextual/background information.

    3. frame a comparison of generic metadata standards (e.g. Dublin Core) with standards created specifically for scientific use (FGDC, DwC, EML).

    4. clarify competing criteria for the identification of data that is tied to the scientific assertions carried by a dataset, and not specific to the details of a format or encoding.

    -----------------

    Integrating Conceptual and Empirical Studies of Data to Guide Curatorial Processes
    Carole L. Palmer, Tiffany C. Chao, Nicholas M. Weber, Simone Sacchi, Karen M. Wickett, Allen H. Renear, Karen Baker, Andrea Thomer, & David Dubin

    Two research teams within the Data Conservancy (http://dataconservancy.org/) project are investigating different aspects of scientific data curation. Data Concepts is developing a conceptual model to foster shared understanding of identity conditions and representation levels for data sets. Data Practices is conducting qualitative studies of data production and use in the earth and life sciences, analyzing curation needs, cultures of sharing, and re-use potential across disciplines. This poster will illustrate the integration of results from three phases of research to develop a more comprehensive and practical analysis of fundamental aspects of data curation.

    •    Phase 1, Data Concepts team - Preliminary framework for definitions of “dataset” based on review of technical documentation and scientific literature, to support curation and integration of data across disciplines. Found four common features across definitions--grouping, content, relatedness, and purpose, elaborating each based on evidence from the literature.
    •    Phase 2, Data Practices team - Conceptual mapping of data characteristics, data practices, and curation activities, consisting of approximately 145 terms. Emphasizes relationships between data practices and curatorial activities for application to description and assessment of curation services.
    •    Phase 3, Data Practices team - Analytic potential concept developed as a theoretical approach to assessing the value of data beyond its original intended use. Extends Hjørland’s (1997) notion of “epistemological potential”, acknowledging the essential condition of preservation readiness and two key interrelated factors, potential user communities and fit for purpose.

    We will demonstrate how the Phase 1 framework has been tested and extended based on empirical data and analysis from Phases 2-3. In particular, we show how scientists’ practices and ideas about meaningful units of data adhere to and diverge from the framework’s conception of “grouping”. We also identify and discuss additional elements of “purpose” needed to inform the curatorial processes of selection and appraisal and set curation priorities for making data fit for long-term use.

    References

    Cragin, M.H., Palmer, C.L., & Chao, T.C. (2010). Relating data practices, types, and curation functions: An empirically derived framework. Poster presented at the Annual Meeting of the American Society for Information Science & Technology (ASIS&T), October 22-27, 2010, Pittsburgh, PA.

    Hjørland, B. (1997). Information Seeking and Subject Representation: An Activity-Theoretical Approach to Information Science. Westport, CT:  Greenwood.

    Palmer, C.L., Weber, N.M., & Cragin, M.H. (2011). The analytic potential of scientific data: Understanding re-use value. Proceedings of the Annual Meeting of the American Society for Information Science & Technology (ASIS&T), October 9-12, 2011, New Orleans, LA.

    Renear, A., Sacchi, S., & Wickett, K. (2010). Definitions of dataset in the scientific and technical literature. Proceedings of the Annual Meeting of the American Society for Information Science & Technology (ASIS&T), October 22-27, 2010, Pittsburgh, PA.

  • 4/6/2012
    The Digital Preservation Coalition (DPC) and Digital Curation Centre (DCC) are delighted to announce a new issue of our joint newsletter ‘What’s New’.

    Read more

  • 4/4/2012
    Some places remain for the DCC Northeast England Roadshow in Newcastle upon Tyne, 23-24 April. The event is being held at the Darwin Suite, 3rd Floor Bioscience Centre, Times Square, Newcastle The programme has an exciting range of speakers who will share examples of developing research data management services and infrastructure. Registrants can sign up for either or both days of this free event.

    Read more

  • 3/14/2012
    The Digital Preservation Coalition (DPC) and Digital Curation Centre (DCC) are delighted to announce a new issue of our joint newsletter ‘What’s New’.

    Read more

  • 3/13/2012
    The International Journal of Digital Curation, volume 7, issue 1 has been published.

    Read more

  • 3/12/2012
    Europeana project researchers Carlo Meghini and Antoine Isaac visited CIRSS from 6-9 March 2012 to share information and discuss the Europeana and IMLS DCC data models. They led sessions of the CIRSS eResearch Round Table and the CIRSS Seminar Series. (Descriptions and links to slides and audio follow below.) ------------------ eResearch Round Table Session: "Introduction to the Europeana Data Model: Framework & Requirements" Carlo Meghini, Institute of Information Science & Technology, Italy Audio: http://waterfall.lis.illinois.edu/dl/events/errt/errt_mar7_12.mp3 Elluminate recording: https://sas.elluminate.com/p.jnlp?psid=2012-03-07.1031.D.A9529D3539D58BF53A44972864FDBA.vcr&sid=407 The Europeana Data Model (EDM) is the data framework within which all Europeana data is ingested, managed, and published. This talk describes the fundamental parts of EDM and the EDM requirements that data providers need to meet to successfully produce EDM compatible data. The talk will culminate with a discussion of how EDM can accommodate collection-level representations. Bio: Carlo Meghini is a prime researcher at ISTI, working in the area of digital libraries and digital preservation. In the area of digital libraries, he has been involved in the DELOS Network of Excellence in Digital Libraries, contributing to the DELOS Reference Model for Digital Libraries; he participated in the FP6 Integrated Project BRICKS, aiming at developing a distributed Digital Library Management System, in the DL.org coordination action, and is involved in the making of Europeana since 2007, through the EDLnet, Europeana version 1.0, Europeana version 2.0 and ASSETS Best Practice Networks. In the area of digital preservation, he has been involved in the CASPAR project, an FP6 Integrated Project aiming at developing an OAIS-based architecture for preservation; he has also taught the OAIS Reference Model in several events organized by the CASPAR Project in conjunction with PLANETS and DPE Network of Excellence in Digital Preservation. For more information:http://www.nmis.isti.cnr.it/meghini/. Suggested readings: * Europeana documentation, including Primer, is available at: --> http://version1.europeana.eu/web/europeana-project/technicaldocuments/ * Paper: The Europeana Linked Open Data Pilot; Haslhofer, Bernhard and Antoine Issac. Proc Int'l Conf. on Dublin Core and Metadata Applications 2011. --> dcpapers.dublincore.org/index.php/pubs/article/view/3625 --------------- CIRSS Seminar: A Model for Digital Libraries and its Translation to RDF Carlo Meghini, Institute of Information Science & Technology, Italy With the advent of the Web, the traditional concept of library has undergone a profound change: from a collection of physical information resources (mostly books) to a collection of digital resources. Additionally, the notion of digital resource includes not only texts in digital form, but in general any kind of multimedia resources. In a traditional library, physical information resources are managed through well-understood manual procedures, whereas in a digital library digital resources are organized according to a data model, discovered through a query language and managed in a highly automated way. In this paper, we present a data model and query language for digital libraries supporting identification, structuring, metadata support, re-use and discovery of digital resources. The model that we propose is inspired by the Web and it is formalized as a first-order theory, certain models of which correspond to the notion of digital library. We demonstrate the feasibility of the model and its suitability for practical applications by providing a full translation of the model to RDF and of the query language to SPARQL. Bio: Carlo Meghini is a prime researcher at ISTI, working in the area of digital libraries and digital preservation. In the area of digital libraries, he has been involved in the DELOS Network of Excellence in Digital Libraries, contributing to the DELOS Reference Model for Digital Libraries; he participated in the FP6 Integrated Project BRICKS, aiming at developing a distributed Digital Library Management System, in the DL.org coordination action, and is involved in the making of Europeana since 2007, through the EDLnet, Europeana version 1.0, Europeana version 2.0 and ASSETS Best Practice Networks. In the area of digital preservation, he has been involved in the CASPAR project, an FP6 Integrated Project aiming at developing an OAIS-based architecture for preservation; he has also taught the OAIS Reference Model in several events organized by the CASPAR Project in conjunction with PLANETS and DPE Network of Excellence in Digital Preservation. For more information: http://www.nmis.isti.cnr.it/meghini/. Europeana and Linked Open Data Antoine Isaac, Vrije Universiteit Amsterdam Europeana recently launched a small animation advertising Linked Open Data to its network of partner cultural institutions: http://vimeo.com/36752317. Why? Why now? This talk will briefly introduce Linked Data technology, as especially seen from the perspective of cultural institutions like libraries. It will then present the particular perspective of Europeana, who has recently openly released part of its metadata as Linked Data -- see http://data.europeana.eu . How can we benefit? How can we contribute? A technical perspective on exchanging and enriching data will be presented, and complemented with a more strategical one, where Linked Data paradigm goes hand in hand with parallel efforts to make cultural data open (http://pro.europeana.eu/support-for-open-data). Bio: Antoine Isaac works as scientific coordinator for Europeana (http://www.europeana.eu/) and researcher in the Web and Media group (http://wiki.cs.vu.nl/web-media/) at the Vrije Universiteit Amsterdam. He has been investigating and promoting the use of Semantic Web and Linked Data technology in the Cultural Heritage environment since his PhD studies in Computer Science at the Universit Paris IV Sorbonne (http://www.paris-sorbonne.fr/en/) and the Institut national de l'audiovisuel (http://www.ina.fr/). His work focuses especially on the representation and interoperability of collections and their vocabularies.
  • 3/9/2012
      St��phane Goldstein, Research Information Network (RIN), has authored an article for LIBER Quarterly that summarizes the collaborative approaches fostered by the RIN Working Group on Information Handling and the DaMSSI project.    The DCC is a member of the working group and led the work of the DaMSSI project.    

    Read more

  • 3/5/2012
    Friday saw the launch of two new UMF-funded data management infrastructure projects: Dataflow and VIDaaS. DataFlow is a two-tier data management infrastructure that allows users to manage and store research data. VIDaaS (Virtual Infrastructure with Database as a Service) is software that enables researchers to build databases online as part of a cloud infrastructure.

    Read more

  • 3/1/2012
    The DCC has recently started working on a one-to-one basis with a number of Higher Education Institutions. The work takes the form of a series of Institutional Engagements which allow the DCC to provide tailored support to institutions and help them increase their research data management capability.

    Read more