Archive Data Repository
'Archive Data Repository' collects data and data catalogs from one or more data sources and stores the data in a focused repository that is suited to a particular set of ITS data users. It includes capabilities for performing quality checks on the incoming data, error notification, and archive to archive coordination. It includes the capability to define a data registry that allows registration of data identifiers or data definitions for interoperable use throughout a region. It supports a broad range of implementations, ranging from simple data marts that collect a focused set of data and serve a particular user community to large-scale data warehouses that collect, integrate, and summarize transportation data from multiple sources and serve a broad array of users within a region. Repositories may be established to support operations planning, performance monitoring and management, and policy and investment decisions.
- The center shall collect data from data distribution systems and other data sources.
- The center shall respond to requests from the administrator interface function to manage center-sourced data collection.
- The center shall collect data from centers.
- The center shall collect data catalogs from one or more data sources. A catalog describes the data contained in the collection of archived data and may include descriptions of the schema or structure of the data, a description of the contents of the data; e.g., time range of entries, number of entries; or a sample of the data (e. g. a thumbnail).
- The center shall store collected data in an information repository.
- The center shall perform quality checks on collected data.
- The center shall notify the system operator of errors related to data collection, analysis and archival.
- The center shall respond to requests from the administrator interface function to manage the archive data.
- The center shall include capabilities for archive to archive coordination.
- The center shall provide the capability to execute methods on the incoming data such as cleansing, summarizations, aggregations, or transformations applied to the data before it is stored in the archive.
- The center shall respond to requests for archive data from archive data users (centers, field devices).
- The center shall provide capabilities to access "in-place" data from geographically dispersed archives. These capabilities may include analysis, data fusion, or data mining.
- The center shall provide the specialized publishing, directory services, and transaction management functions associated with coordinating remote archives.
- The center shall support the collection of archived data from other archives on an as-needed basis. (This minimizes the need to duplicate the comprehensive set of data from the remote archives in the local data warehouse.)
- The center shall use data collected from different archives to build a set of global schema including the data archive definitions for the local archive plus any archives known to the local archive.