RDM

Holistic approaches to Institutional Repositories at Open Repositories 2018

Open Repositories 2018

Open Repositories, an annual international conference that brings together users, developers, and librarians to discuss open digital repository platforms for institutional data and scholarship, was held in Bozeman, Montana from June 4 - 8. Anna Sackmann, a librarian and RDM consultant, attended to learn more about how other institutions are incorporating repository deposit into the researcher workflow and to present related local efforts at UC Berkeley.

Guides to making research software reproducible, citable, and well-documented

Computational research workflow diagram

Preserving and maintaining research software is a challenge to researchers and academic libraries. In my role as a CLIR postdoctoral fellow in software curation, I recently discussed the technical challenges of preserving and maintaining research software in a UC Berkeley Library Update, Research Software in Practice, posted in May.

UC DLFx conference tackles the new frontier of data management

Vessela Y Ensberg (UC Davis), Emily Lin (UC Merced), Ho Jung Yoo (UC San Diego), Amy Neeser (UC Berkeley) at UCDLFx 2018

The inaugural 2018 University of California Digital Library Forum (UC DLFx), "Building the UC Digital Library: Theory and Practice," took place February 27th to March 1st at UC Riverside. The conference brought together librarians, digital technology experts, educators, policy-makers, and research support staff from the UC campuses and the California Digital Library. Participants discussed and explored how libraries and research support departments are engaging with the constantly changing and challenging world of data and digital scholarship.

Using rclone to transfer data to bDrive

RClone Browser screen shot

The bDrive repository offers everyone at UC Berkeley unlimited storage, strong search capabilities, and mobile access. This storage is an important data management resource for research teams. The standard web client, however, does not always work well when dealing with very large files, many files, or deep folder structures. The web client’s connection is slow, and can disconnect in the midst of a lengthy, time-consuming transfer.

IPython notebook available to ease data transfer between Savio and Box

Screenshot from the iPython notebook configured for Box access,TransferFilesFromBoxToSavioScratch.ipynb

For researchers running computation on the Savio high-performance compute cluster, data transfer can be a challenge. A new IPython notebook simplifies data transfer from the free Box collaboration platform to a Savio user’s scratch folder, and provides a template for users to develop their own algorithms that analyze data stored in Box.

Migrating half a million Hearst Museum images to Box

Card catalog images, representative of 527,000 similar images migrated to Box

The Phoebe A. Hearst Museum of Anthropology (PAHMA) recently found itself with over half a million digital catalog card images that are in active use, but needed to be duplicated in order to preserve them in a redundant, reliable archive. Copying a few hundred, or even a few thousand files is a relatively straightforward task. Assuring that 527,000 files are successfully copied in a reasonable period of time, without requiring constant attention, is trickier. Research IT worked with PAHMA’s Dr.

Pages