This is the extended version of the abstract published in: Vujić, M. and Šalamon, D., eds.: Book of abstracts *of the National Open Data Conference. University of Zagreb, Faculty of Traffic and Transport Sciences, *Zagreb, 2021.
The rapid development of space technology and the increased interest in space exploration have resulted in the intensive observation of celestial bodies, mostly in the solar system, over the past decade with the prospect of an upward trend in the future. Large amounts of collected data on space bodies impose the need to develop the Spatial Data Infrastructure of Celestial Bodies at the general level to enable standardized organization and storage of these data, and their efficient use and exchange. To approach the development of such an infrastructure, it is necessary to investigate what data, as well as how and to what extent, are collected through space observation. It is also necessary to investigate how this data can be obtained. This paper provides an overview of planetary spatial data archives, data storage and retrieval methods, and their shortcomings in the context of easy search, download and interpretation of data, all with the aim of establishing Spatial Data Infrastructure of Celestial Bodies that would make space data more accessible to the public and non-planetary scientists.
Today, with the advancement of technology and the widespread use of the Internet a lot of information is easily accessible to everyone with Internet access, and spatial information is one of the most important elements to support decision-making in many disciplines. Organizations around the world spend millions of dollars each year on the production and use of spatial data
This paper provides an overview of planetary spatial data archives, data storage and retrieval methods, and their shortcomings in the context of easy search, download and interpretation of data, all with the aim of establishing SDICB that would make space data more accessible to the public and non-planetary scientists.
The first part of the paper explains planetary data and their importance and where this data can be found. The most frequently used archives and portals for access to space data are explained, and the ways of access to data as well as the main shortcomings of archives are investigated. Numerous interoperable initiatives have been launched to solve the existing problems of current ways of distributing space data. The mentioned initiatives serve as a basis for the development of the SDICB, the concept which is explained together with the future work of this research.
Since ancient times, mankind has always explored the stars and planets and used their observations for various applications by which they revolutionized perception and understanding of the world. With the advancement of technology, planetary science has greatly advanced, and planets have become objects of scientific research using in-situ and remote sensing methods. Our knowledge of the solar system is increasing with advances in technology and spacecraft are visiting more and more planets, satellites, comets, and asteroids. Planetary science is the science of studying planets, moons, and other space bodies in the solar system and space
Nowadays, planetary data increasingly converges with terrestrial geo-scientific visualizations and analyzes tools. To present this data as faithfully as possible, a spatial component is needed, and today more and more space spatial data are being collected. Space spatial data are all data with a spatial component, obtained by remote sensing methods, navigation methods, georeferencing of images or collected in-situ by rovers with enough spatial information to be projected on the space body. The space spatial data stored in various archives are not easy for use by non-professional scientists, so the appropriate metadata is provided, along with different types of image data to facilitate their use for the non-planetary community
The conversion of raw (unprocessed) space data into spatially enabled data involves great efforts in time and knowledge. For this to be achievable, missions must be developed and calibrated appropriately, and software and platforms for distribution of this data require continuous development and maintenance
The basic division of space spatial research data is into fundamental data, which include geodetic control networks, topographic data, and ortho-images, while the second group is framework data which include composite maps, planetary nomenclature, and geological maps. All these data have relevant information about the position in the reference coordinate system of the planet or body they are referring to. In the space archives we can find many other data, for example altimetric data for each space body, data on plasma and atmosphere of planets, orbital data, radar data, geometric shapes of planetary models and their parts, spectrometric data, etc. The most common types of data are geological data that can be found in all archives for each planet and other bodies. Geodetic and geophysical data include data on reference systems, planetary rotation and shape, topography, and gravitational models. It is also important that each data has corresponding metadata because without information on how they were taken, with which instrument and for what purpose, they would not be usable. The formats in which data is most often stored are VICAR (Video image access and retrieval) format, PDS (Planetary Data System), FITS (Flexible Image Transport System) and Raw formats such as JPEG.
For planetary data cartography, GIS, and remote sensing have an important role. Mapping of space bodies began with the invention of telescope, and with the advancement of technology, today the most important role in the mapping of space bodies has launch space missions that carry high-resolution cameras for data collection and based on this data cartographic maps are created. The biggest difference in the use of space to Earth data is the shape and size of the body being observed. International Astronomical Union (IAU) defines for all major bodies in our solar system geodetic parameters, which enables their mapping, and which are the basis for the data interoperability
The innovative nature of space missions creates new technologies and study techniques that will later benefit the public on Earth and contribute to the education of future generations. Space research also contributes to the public in the form of a growing understanding of the solar system and the importance of our planet in it
Collection and distribution of space data face several challenges. One of them is the standardization of cartographic methods and data which is critical for accurate and precise analysis. Today, this problem is of great interest since access to data is easy and accessible to everyone. But there are still no adequate ways to store this data as well as distribute it to users. One of the main problems is extremely decentralization of space community where each organization has its own archives and data sources, but also the standards and formats they use. The solution to these problems is the development of standards that would enable the interoperability of these data between different communities and that would create the foundations for the establishment of a SDICB. Therefore, it is necessary to create spatial data that will satisfy users and that will be easy to find, share and interpret.
Space research data is archived and made available to users through data archives and portals. Access and download to all of the data is free of charge to everyone. New missions increase the number of data every year, which does not make it easier to search and access the data.
The largest archive of space research data is NASA`s Planetary Data System (PDS),
Following the PDS, the European Space Agency has created the Planetary Science Archive (PSA). PSA is an archive of data from ESA`s space missions but also distributes data from some of NASA`s missions. PSA supports and uses NASA`s PDS3 / 4 standard for archiving the data, and data is distributed through a web data browser which is based on filtering data options by type, mission, and instrument. In addition to NASA, United States Geological Survey (USGS) has the largest database of space data. USGS has developed several web searches tools. Integrated Software for Imaging and Spectrometers (ISIS) is one of USGS`s portals which consists of software for downloading, processing, and calibrating radiometric and geometric data of space research. It supports the missions of NASA, ESA, ISRO and JAXA. USGS independently creates documentation and metadata for distributed products. Map-a-Planet (Astropedia) is also under the jurisdiction of the USGS and provides to users projected images and ready-to-use products. It supports standardized WMS web mapping services for all bodies, and data search is enabled through the map view. All data can be visualized before download. On this portal it is possible to download GIS-ready products for use in standard GIS tools. According to many research and surveys, Astropedia is the most user-friendly portal and the easiest to use, especially for non-planetary users. Imaging Node Annex provides users with spatial data and products derived from PDS data (mosaics, maps, SHP files, databases). For each product the link to the original publications and information about the quality of the data is provided. Portal creates own metadata for products that are provided to users. Astrogeology USGS is a Web portal that offers users to download WMS services for more than 30 different space bodies with over 100 images. Access to WMS services of this portal is possible directly through GIS applications ArcMap, QGIS and GDAL.
These are just some of the most used portals and tools for downloading and retrieval of space research data. Numerous space missions are developing their own portals as well as national space agencies. There are also several open tools to facilitate the search of PDS and PSA, specialized portals and software for image data, and tools for data conversion or product generation from raw data formats.
But despite the fact that there are many places where data can be searched and downloaded, users, especially non-planetary users, have a general problem when searching for the data: “Where to download data and how to search for it?”. Borden and Bishop
SHORTCOMINGS OF THE ARCHIVES
The survey by Borden and Bishop
The motivation to support common, interoperable data formats and standards in space research is not only to improve access to data and products but also to address the problem of distributing increasing amounts of data. Use of standards increases the reach of data, enables better visualization and analysis of data, and increases their efficiency
Basic for spatial analysis are geodetic coordinates, and coordinate systems and frameworks which define the precise position of an object in relation to the agreed origin
The USGS is a major provider of tools for processing and analyzing maps and other spatial data of NASA missions. The tools support Integrated Software for Imagers and Spectrometers (ISIS) and a specialized software package for image processing and processing of other space research data
Most of the data in the archives are stored in the original (raw) format of the mission instrument. For such data to be used for further analysis in GIS tools, they need to be georeferenced. Unfortunately, the PDS format is not widely recognized in GIS tools. Two formats recommended for use in space missions that are commonly used are GeoTIFF and GeoJPEG2000. GeoJPEG200 was approved by the PDS, and in 2008 it was used for the first time in Mars HiRISE mission. Another format used by the astronomical community, and whose use is also recommended for space exploration, is GeoFITS. The format is compatible with PDS standards and is supported by many open-source software tools and catalogs.
For web service interoperability, the consortium that defines standards is the Open Geospatial Consortium (OGC). Several space missions support WMS / WFS standards. This allows users to search and visualize data projected on a map in JPEG or PNG format. Of the currently published WMS/ WFS services, only some include support according to IAU recommended for coordinate systems. Also, the WMTS service is used, which enables faster delivery of map layers but is not as flexible as the previous two and cannot generate images at any scale. WCS / WCPS network services have also found application in space research
IAU recommendations do not cover other standards relevant to digital mappings, such as attribute feature representations, symbols, color scales, and metadata. Digital maps must use standards so consistent cartographic products can be developed. Attributes and symbols for digital maps of space bodies are defined in the Digital Cartographic Standard for Geological Map Symbolization
Most existing space data portals often include minimal metadata and therefore have limited search capabilities for external users. The methods defined by the OGC CSW standard can facilitate user access, so users do not need to build new search tools. One of the main advantages of using the OGC CSW standard is the ability to index one data portal into another. Products served by such mutually indexed portals must provide references to data creators and source data portals. Today, more and more initiatives are encouraging the use of OGC standards in space research. Well-structured and stored metadata are extremely important for achieving interoperability. PDS metadata, in which most of space data is archived, are not supported for use on widespread spatial data portals. Most geospatial portals require metadata as defined by the FGDC or ISO. Methods for converting PDS to FGDC / ISO metadata standards should be possible, especially because FGDC metadata standards require only a few minor add-ons to properly support space data
Nomenclature is also important for any cartographic product, providing context for research and analysis. The main institution for nomenclature is the IAU, which through the Gazette of the Planetary Nomenclature collects requests and publishes official nomenclature that is used in all missions. The official IAU nomenclature is publicly available for download in KML and WFS format.
Many initiatives to improve the availability of space data have been launched through volunteer communities and various private organizations. One of these initiatives is MPASIT, which provides recommendations for the development of a comprehensive planetary spatial data infrastructure. Europlanet initiative aims to connect the European space community and project of geological mapping of Mars, Mercury and the Moon was also launched within it. The VESPA initiative deals with the availability and distribution of space data from various scientific domains. Several initiatives address the standardization of space data archiving such as PlanetServer, OpenPlanetary initiative provides an online framework to help collaborate between different institutions in planetary mapping, and CARTO initiative is focused on web solutions for spatial visualization and data analysis of space research data.
One of the solutions to the problem of rapid increase in the amount of space research data is the establishment of an efficient SDI. The concept of SDI is widely applicable to any spatial data and is not limited to Earth data and allows data interoperability through policies and standards by defining mechanisms for data storage and access. In the paper of Laura et al., this concept applied to space data is called Planetary Spatial Data Infrastructure (PSDI)
The SDICB is an extension of the traditional terrestrial SDI that will allow standardized collection, management, and retrieval of spatial data from space exploration. Today, space data is stored in archives and portals and such archives are not user-centric, do not allow semantic data search, and are adjusted for space scientists and research. The existing archives, given their objectives and method of implementation, currently do not meet the main principles of the SDI. SDI must serve the wider community whose members do not need to be spatial data experts and who do not understand the intricacies of storing, retrieving, and using spatial data. Archives are technology-oriented and need to focus on simplifying data access and improving data usability. The main reasons why SDICB is needed in the space community is that its establishment would keep all data in one place, avoid duplication of data from different agencies and space research teams, harmonize the formats of data collected and achieve their interoperability, simplify access and downloading. Using these datasets would reduce the number of difficult-to-understand data access tools and allow other users outside the space community to access and use the data. The establishment of SDICB would increase user confidence in the interoperability and accuracy of the data, which would contribute to scientific research and decision-making.
Current methods of archiving data do not allow their use by non-professional users who are not involved in the data collection. There are several initiatives within NASA and other space agencies focused on increasing the availability of space research data to the public. Most space data users are unable to process and use raw data without investing much time and effort in understanding archived data. When accessing space data, there is a big problem of decentralization of the data, given that there are many places where data can be accessed and there is a big problem of data duplication. Insufficiently clear metadata, which does not adequately describe the data and does not provide good methods for searching and filtering data of interest, is also a big problem. SDICB would centralize data by establishing a single access point for spatial data of space exploration missions. Adopting recommendations from various initiatives to improve data availability and interoperability through a unique space research metadata catalog as shown in Figure 1 would provide standardized ways of accessing data for all users, make metadata understandable to all, and would allow flexible data access technologies.
Given the great challenges and high costs associated with collecting space data, it is not surprising that the development of SDICB was not the primary problem of the space community. However, by treating and preparing the collected data as multi-purpose infrastructure products
As with terrestrial implementations, the SDICB concept would have 5 basic components: policy, access networks, standards, people, and data. But each of these components must be further expanded to meet space data requirements. The extension refers to answering various questions related to data and other components, and the answers to these questions provide the current state of space spatial data management and identify subjects that need to be engaged. The list of knowledge collected in this way is ideal for identifying not only the current state of the spatial data management system but also for identifying strategic gaps that should be addressed during the establishment and creation of SDICB
After researching the ways of archiving space research data and their shortcomings, as well as research of various initiatives related to achieving interoperability in the space community, the SDICB concept is considered as the best solution to address these shortcomings. To define the framework for the establishment and implementation of such a concept, it is necessary to answer numerous questions and define all the elements for successful implementation. This includes establishing the definition of the SDICB with the main objectives, vision, and reasons for its establishment. The concept itself must include detailed studies on the needs and identification of users, stakeholders, and the legal basis (policies and standards) for establishment. It is necessary to solve the institutional and organizational structure (level of development and strategic plan) of the project establishment, to define the basic and thematic data, standards, and methods of collecting and storing relevant metadata. As part of the development of such a concept, a survey is currently being conducted to assess the current situation and user needs, and the results of which, along with additional research on the remaining elements of the concept, will serve in the future to establish guidelines for implementing the SDICB concept.
Today, there is a great increase in volume and art of space research data and the growing interest of scientists and the public in using this data. Data open and accessible to everyone does not make it easier to find and use it. The archives in which data are stored are intended for their long-term preservation and are focused mainly on data, not the users. To find and interpret data of interest requires too much effort and time, and searches are possible at the missions and instruments levels. Archives do not allow semantic data search, which indicates that there is a problem with insufficiently documented metadata. In addition, one of the main problems are data formats as well as their interoperability and use in standard tools. Over the past few years, several initiatives have emerged to standardize ways of collecting, storing, and distributing space research data to make them interoperable and accessible to a wide range of users. Each of these initiatives provides recommendations on how to store data and related metadata. One of the solutions to this burning problem in space community is the establishment of a SDICB that would effectively distribute data through an agreed set of standards based on recommendations, institutional cooperation agreements and policies. Successful implementation of such a concept would serve the wider community and provide an easy way to search, access and use data for all users, not just space scientists, as is the case with current ways of archiving data. This way of organizing data will allow maximum use of data and facilitate their use, especially for citizens. The scientific and research potential of archived data will increase, which will contribute to a better and easier understanding of, for example, the physics and dynamics of planets and other space bodies, and the understanding of physical and dynamic processes on Earth.
This research is part of TODO project that has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 857592.