Accession Number:

ADA595983

Title:

Big Data Quality Case Study Preliminary Findings, U.S. Army MEDCOM MODS

Descriptive Note:

Technical rept.

Corporate Author:

MITRE CORP BEDFORD MA

Report Date:

2013-09-01

Pagination or Media Count:

55.0

Abstract:

A set of four case studies related to data quality in the context of the management and use of Big Data are being performed and reported separately these will also be compiled into a summary overview report. The report herein documents one of those four cases studies. The purpose of this document is to present information about the various data quality issues related to the design, implementation and operation of a specific data initiative, the U.S. Armys Medical Command MEDCOM Medical Operational Data System MODS project. While MODS is not currently a Big Data initiative, potential future Big Data requirements under consideration in the areas of geospatial data, document and records data, and textual data could easily move MODS into the realm of Big Data. Each of these areas has its own data quality issues that must be considered. By better understanding the data quality issues in these Big Data areas of growth, we hope to explore specific differences in the nature and type of Big Data quality problems from what is typically experienced in traditionally sized data sets. This understanding should facilitate the acquisition of the MODS data warehouse though improvements in the requirements and downstream design efforts. It should also enable the crafting of better strategies and tools for profiling, measurement, assessment, and action processing of Big Data Quality problems.

Subject Categories:

  • Information Science
  • Computer Programming and Software

Distribution Statement:

APPROVED FOR PUBLIC RELEASE