{"action":"create","ckan_id":null,"date_created":"Sat, 04 Apr 2026 22:12:53 GMT","date_finished":null,"harvest_job_id":"7fe82130-bb86-4c0a-80d3-ed1946dab111","harvest_source_id":"3dddf3ae-84a3-4731-88b4-6a0b57d0e503","id":"34595362-bae9-4d28-893c-fbc85c258c24","identifier":"https://datainventory.usbr.gov/rise/item/128581","parent_identifier":null,"source_hash":"4be5c1fc9442b34574c1ec15aed96b9cb14be6b8032944a662545ff66ab04234","source_raw":"{\"accessLevel\": \"public\", \"accrualPeriodicity\": \"irregular\", \"bureauCode\": [\"010:10\"], \"contactPoint\": {\"@type\": \"vcard:Contact\", \"fn\": \"RISE Team\", \"hasEmail\": \"mailto:data@usbr.gov\"}, \"description\": \"The Reclamation Research and Development Office funded an evaluation of file formats for large datasets to use in RISE through the Science & Technology Program. A team of Reclamation scientific and information technology (IT) subject matter experts evaluated multiple file formats commonly utilized for scientific data through literature review and independent benchmarks. The network Common Data Form (netCDF) and Zarr formats were identified as open-source options that could meet a variety of Reclamation use cases. The formats allow for metadata, data compression, subsetting, and appending in a single file using an efficient binary format. Additionally, the Zarr format is optimized for cloud storage applications. While support of both formats would provide the most flexibility, the maturity of the netCDF format led to its prioritization as the preferred RISE file format for large datasets. \\r\\n\\r\\nThis report documents the evaluation and selection of large data file formats for the RISE platform. Additionally, a preliminary list of identified changes to the RISE platform needed to support the netCDF format is provided. The intent is to frame future RISE development by providing a roadmap to support large datasets within the platform. \", \"distribution\": [{\"@type\": \"dcat:Distribution\", \"accessURL\": \"https://data.usbr.gov/catalog/8002/item/128581\", \"description\": \"Landing page for \\\"S&T Project 22041 Final Report: Evaluation of file formats for storage and transfer of large datasets in the RISE platform \\\"\", \"mediaType\": \"text/html\", \"title\": \"RISE Item Details Page URL for \\\"S&T Project 22041 Final Report: Evaluation of file formats for storage and transfer of large datasets in the RISE platform \\\"\"}, {\"@type\": \"dcat:Distribution\", \"description\": \"\\\"S&T Project 22041 Final Report: Evaluation of file formats for storage and transfer of large datasets in the RISE platform \\\" as a PDF file\", \"downloadURL\": \"https://data.usbr.gov/rise/content-rise-public/rise/catalog-item/binary/2023-12-31 rise report final.pdf\", \"mediaType\": \"application/pdf\", \"title\": \"PDF File for \\\"S&T Project 22041 Final Report: Evaluation of file formats for storage and transfer of large datasets in the RISE platform \\\"\"}], \"identifier\": \"https://datainventory.usbr.gov/rise/item/128581\", \"keyword\": [\"Modeling\", \"Open Data\", \"Storage\"], \"landingPage\": \"https://data.usbr.gov/catalog/8002/item/128581\", \"modified\": \"2024-01-05T19:19:09Z\", \"publisher\": {\"@type\": \"org:Organization\", \"name\": \"Bureau of Reclamation\"}, \"spatial\": \"{\\\"type\\\":\\\"Polygon\\\",\\\"coordinates\\\":[[[-180,-90],[-180,90],[180,90],[180,-90],[-180,-90]]]}\", \"title\": \"S&T Project 22041 Final Report: Evaluation of file formats for storage and transfer of large datasets in the RISE platform \"}","source_transform":null,"status":"error"}
