Understanding the Data Reference Model Simplified 101

Before FEAF was introduced, different government agencies performed their functions in disparate ways and used all kinds of data formats to store and process data. This led to problems in data sharing as well as coordination between the agencies when their paths crossed or quick coordination required between them.

FEAF aims to facilitate shared development of common processes and information, among government agencies, working within the US federal structure.

In this post you will learn the Data Reference Model a.k.a DRM, as specified by the US federal enterprise architecture framework (FEAF).

Table of Contents

What is Federal Enterprise Architecture Framework?

FEAF can be further partitioned into 5 functional parts:

Business Reference Model (BRM): Answers questions like Who does what, and how/when/why. The components reference model (CRM): The software/hardware components that are used to execute the BRM.

Technical Reference Model (TRM): The Computer/Communications technology and standards that support above three layers.
Data Reference Model(DRM): Defines standard ways to describe all the information/data that will be used by the government agency to execute its duties and programs.
Performance Reference Model (PRM): Defines standard ways to measure the value delivered by above processes/architectures.
Security Reference Model (SRM): which defines how security should be defined and maintained across organizations and while the data they use is stored or is under transit(shared).

Understanding Data Reference Model

Data Reference Model, falls in the Data architecture partition.

A detailed technical discussion of all the aspected of the Data Reference Model would be quite lengthy and out of scope for this article, but you will learn its salient features and their importance.

“Data Reference Model” refers to organizing and standardizing business information, with a view to gain uniformity, ease sharing and avoid ambiguity. The Data Reference Model describes artifacts which can be generated from the data architectures of government agencies and promotes uniform data management practices.

Data Reference Model aims to describe, categorize and easily share data.

Therefore, its further divided into 3 standardization areas.

Data Reference Model: Data Description
Data Reference Model: Data Context
Data Reference Model: Data Sharing

Data Reference Model: Data Description

Provides a means to uniformly describe data, with an aim to ease its discovery and sharing. The aim here is to ensure that there are minimal or no discrepancies between data with similar context. It ensures that the data has proper syntax and semantics and each data resource is meaningfully identified. It is further ensured that one must be able to quickly and accurately identify desired data, reuse data whenever needed and data artifacts are harmonized for easy comparison. Data description aims to eradicate any ambiguities in data interpretation, ensure that data schemas are accompanied by their respective data dictionaries, and incremental changes maintain the existing standards.

Data Reference Model: Data Context

Adds context and classification to data and data files, to facilitate discovery of data. It has well defined taxonomies to enable the clear identification of data assets that pertain to a community of interest.
This layer defines some key concepts to execute its purpose, some of them are:-

Data Assets: It’s a managed container for data, e.g. a database or document repository
Data Steward: A person responsible for managing a Data Asset
Relationships: as the name suggests, its a relationship between two topics, which is clearly defined here.
Topic: A category within the taxonomy is called a Topic here, its often synonymous with “node”. A Topic could further categorize a Data asset or a Digital Data Resource or a Query Point.

As an example, let’s consider a stack of feedback forms filled by users, pertaining to different products.

The Data asset is the form and the responses recorded for each question.

The CIO or Database administrator could be the Data Steward.

If a user can full up multiple forms, there exists a many-to-one relationship between feedback and users.

The topics here could be the products or the whole feedback exercise.

Data Reference Model: Data Sharing

Once the data has been properly described and contextualized, a well defined protocol to request data, grant access, exchange data through well defined fixed/recurring transactions between parties is needed.

Without sharing data, no cooperation or communication is possible between agencies. This layer/area exists for precisely the above function. It takes help from the Data Description layer to get a uniform definition of Exchange Packages and Query Points , and from the Data

Context layer it borrows the categorization of data exchange Packets and discovery of Query Points.

Then it provides architectural patterns and mechanisms to securely share data.

The Data-Supplier-to-Consumer-Matrix is an important construct defined in this layer.

Data Reference Model: data supplier to consumer matrix

Hevo is the only real-time ELT No-code Data Pipeline platform that cost-effectively automates data pipelines that are flexible to your needs. With integration with 150+ Data Sources (40+ free sources), we help you not only export data from sources & load data to the destinations but also transform & enrich your data, & make it analysis-ready.

Start for free now!

Get Started with Hevo for Free

Guidelines, Goals and Impacts of Data Reference Models

There are prescribed guidelines to Extract, Transform, Load (ETL) data, Data Publication, Entity/Relationship Extraction (Unstructured documents to structured documents or structured data objects), Document Translation, Contextual and Structural Awareness, Content Search and Discovery Services, Retrieval Services, Subscription and Notification Services etc.

Further, the goals of the Data Reference Model are:

Increase the visibility and availability of data and related data artifacts.
Enhance and promote information sharing.
Facilitate harmonization within and across COIs to form common data entities that support sharing.
Increasing the relevance and reuse of data/data artifacts via consistent taxonomical classification.

Before you move ahead, you must understand that the Data Reference Model is an abstract framework, it just provides foundational principles of how things should be done, and does not force any methodology/technology/implementation of its followers.

Agencies that implement Data Reference Model can choose multiple implementation approaches, methodologies and technologies; based on their budget/circumstances, it’s allowed as long as they remain consistent with the foundational principles of the Data Reference Model.

For example, for data exchange and transmission, agencies can choose to implement their part of the picture via SOAP ( Simple Object Access Protocol, EDI based) OR via XML( eXtensible Markup Language, World Wide Web Consortium based) ; if they are consistent with the specs of the Data Reference Model, the information exchange and interplay can work smoothly.

Below is how the 3 layers of Data Reference Model influence the processes within an agency.

Security and Privacy in Data Reference Model

All the 3 functional areas of Data Reference Model comply with Security and privacy considerations.

Security here defines the methods to protect data from unauthorized access/disclosure/modification/deletion etc., and applies to data at rest as well in transit.

Privacy here means divulging only the minimum necessary and allowed information when data is being collected., transmitted or stored. Not only individual privacy but moral and ethical concerns are addressed well by Data Reference Model.

For different types of data/usage, there are different standards and legislations prescribed by the federal and state governments. Data Reference Model complies to the following policies/legislations:-

National Institute of Standards and Technology (NIST) FIPS: FIPS 199 provides standards for the security categorization of federal information and information systems
Federal Information Security Management Act (FISMA): Part of the e-Governance act, FISMA provides a comprehensive framework for ensuring the effectiveness of information security controls over information resources
OMB Circular A-11 (Section 31-8): This Circular specifies the management improvement initiatives and policies to be implemented by agencies to ensure security and privacy.
E-Government Act of 2002: Among other things, this act mandates that OMB issue guidance to agencies on implementing the privacy provisions of the E-Government Act
NIST 800-60

Conclusion

You have learned what the Data Reference Model or Data Reference Model (as a part of FEAF) is. Some of its constituents and salient features, and how it enables different government agencies to co-operate and share relevant data, to speed up processes and save the taxpayers’ money.

Also you have discussed how it inter-operates with other federal policies and standards, to ensure uniform data storage and sharing, while keeping security and privacy as paramount concerns.

However, as a Developer, extracting complex data from a diverse set of data sources like Databases, CRMs, Project management Tools, Streaming Services, and Marketing Platforms to your Database can seem to be quite challenging. If you are from non-technical background or are new in the game of data warehouse and analytics, Hevo Data can help!

Visit our Website to Explore Hevo

Hevo Data will automate your data transfer process, hence allowing you to focus on other aspects of your business like Analytics, Customer Management, etc. This platform allows you to transfer data from 150+ multiple sources to Cloud-based Data Warehouses like Snowflake, Google BigQuery, Amazon Redshift, etc. It will provide you with a hassle-free experience and make your work life much easier.

Want to take Hevo for a spin? Sign Up for a 14-day free trial and experience the feature-rich Hevo suite first hand.

You can also have a look at our unbeatable pricing that will help you choose the right plan for your business needs!

Frequently Asked Questions

1. What is a reference model example?

A reference model is a conceptual framework that defines the layers or components involved in a particular system, often in the context of network architecture or data management.

2. What is an example of a data reference?

A data reference involves identifying or linking data points or data sources within a system. It provides context and ensures that data is used accurately and consistently.

3. What is the data and information reference model?

The Data and Information Reference Model (DIRM) is a conceptual framework used to organize and manage data and information within an organization. It provides a structured approach to defining how data is collected, stored, processed, and utilized.

Pratik Dwivedi Technical Content Writer, Hevo Data

Pratik Dwivedi is a seasoned expert in data analytics, machine learning, AI, big data, and business intelligence. With over 18 years of experience in system analysis, design, and implementation, including 8 years in a Techno-Managerial role, he has successfully managed international clients and led teams on various projects. Pratik is passionate about creating engaging content that educates and inspires, leveraging his extensive technical and managerial expertise.