100% found this document useful (1 vote)

239 views12 pages

Nasdaq Data Link Data Fabric

This document discusses Nasdaq Data Link's Data Fabric product, which aims to simplify the process of ingesting, onboarding, and deploying data within organizations. Data Fabric handles ingesting data from various sources, standardizes it, tags it, catalogs it, and classifies fields for access control. It also provides compute environments preconfigured for financial analysis and prebuilt analytics tools, removing the need for organizations to set these up themselves. The goal is to eliminate the complex steps between selecting data and deploying it for use, allowing clients to focus on core business activities.

Uploaded by

Canaccord1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

239 views12 pages

Nasdaq Data Link Data Fabric

Uploaded by

Canaccord1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Data Fabric

by Nasdaq Data Link

Enabling Rapid Data

Deployment
Enabling Rapid Data Deployment

Nasdaq Data Link powers data-driven decision-making for over

700K users the world over. Our robust, universal API is tailor-
made for financial institutions and a preferred method of data
discovery and ingestion for tens of thousands of organizations.

For the first time ever, we’re empowering our clients with
access to the same technology and team that powers
Nasdaq Data Link. Enabling them to ingest and deploy data
within their organizations with greater speed and efficiency,
allowing them to focus on the core value-drivers of their firm.

2
Continued on the next page
Data Fabric by Nasdaq Data Link

Mapping the data deployment odyssey

According to a survey initiated by Nasdaq and conducted
by Wakefield Research, 93% of portfolio managers are
not fully satisfied with some aspect of their organization’s
data management capabilities.

Given how challenging the path is from wanting data, to deploying data,
this is an unsurprising state of affairs. From ingestion to cleansing
to productizing and finally to deployment, the journey of data onboarding
is perilous.

Ingest Onboard

Connect to Parse & reformat Normalize to std. Document & Load into data
Source data Clean & QA
source & transfer data format/symbology Catalogue warehouse

Productize

Monitor & maintain Research & develop Set up compute

Build final ETL pipeline
delivery and access analytics environment

Portfolio Managers

Analysis Reports
Deploy
Trading Systems

Applications

3
Continued on the next page
Data Fabric by Nasdaq Data Link

With Data Fabric by Nasdaq Data Link, our goal is to make the middle
component invisible to our clients, eliminating the burden of the intervening
steps between data selection and deployment. Like so:

Select data
source Data Fabric

Portfolio Managers

Analysis Reports
Deploy
Trading Systems

Applications

This document will zoom in and break down the 3 super categories that
define data deployment and how the Data Fabric technology and team work
within your organization to make the entire process painless: Ingest,
Onboard and Productize.

Finally, we’ll review the critical, easy-to-use, proprietary tooling we’ve

developed to make the on-going administration of data access, audit,
governance, reporting and more a breeze to conduct called Share & Manage.

4
Continued on the next page
Data Fabric by Nasdaq Data Link

Ingest & Unify

Investors and analysts require data that comes from myriad sources
in highly variable formats. These can be streaming datasets, batched
at different intervals, flat-files downloaded from websites or on-premise
servers, public cloud solutions, Hadoop clusters and more. Each source
and format is unique and requires human analysis with experience diverse
data sources to properly handle.

Websites Cloud Storage Hadoop Clusters Local Server

Data Fabric’s data ingestion team will unify the collection of tables that make
up the dataset. They will then set up pipelines to blobs, SQL queries and
automated scripts to collect the data to a staging environment. Proprietary
Data Fabric monitoring technology ensures that data is being captured
accurately as it flows in from their respective sources.

5
Continued on the next page
Data Fabric by Nasdaq Data Link

The Data Fabric team understands that financial data science depends on
comparable and consistent data. In addition to checking data for missing
values, our team will standardize schema mapping, entity identifiers and
time scales. The mess of sources becomes a time series file ready for
further processing

Raw data is often messy,

Unify
inconsistent and filled
with gaps. Our team of data
scientists transform raw
data into an organized
time-series

Onboard
Nasdaq Data Link has deployed thousands of datasets since its’ inception.
That kind of publishing volume is only possible with the use of machine
intelligence that automates much of the process that would otherwise
require human intervention.

6
Continued on the next page
Data Fabric by Nasdaq Data Link

Our team of data scientists will deploy machine learning models to tag
relevant information in the dataset you wish to onboard. This can be content
to drive analysis, or metadata to help understand data lineage. Combined,
these create searchable and, most importantly, understandable data at the
end of the pipeline.

Data Fabric deploys

advanced data scient
Tag
techniques to identify
the components
Internal Metadata of source
of a data table at scale
8/17/22 Date

AAPL Company

$160
Price
Close price
Hold NLP text

7
Continued on the next page
Data Fabric by Nasdaq Data Link

Once tagging is complete, a catalogue or index of the data becomes

possible. Its purpose is to help current and new users to understand what
data exists and how they can use it.

A factored catalogue allows

users to understand what
Catalogue
data exists and how they
can use it
Field Description
Code to identify
ISIN security
Cash available to
Free cash flow company
Interest coverage Ratio of ease of
ratio payment debt
Alternative data for
Retail football tracking activity

8
Continued on the next page
Data Fabric by Nasdaq Data Link

Data classification is an important step at this stage in our process,

especially as it pertains to auditing and permissioning potentially sensitive
components of a dataset. Data can be classified to help control compliance,
governance and privacy. By switching to field level classification instead of
user level classification, more data can be shared with more people, while
ensuring sensitive information remains secure.

A factored catalogue
Access
allows users to
understand what data
Public field
exists and how they
can use it Public field All users

Open field

Sensitive field Per user access

Private field Per department

access

9
Continued on the next page
Data Fabric by Nasdaq Data Link

Productize
The traditional analysis pipeline has a number of components which requires
key decisions at each step. When multiplied by the tens of thousands of
packages available, even small teams can end up with a set of fractured
services that don’t support each other.

Set up compute environment

Fabric supports the most popular and proven tools for financial data analysis
like python, R and Scala to name a few. With the dozens of IDE’s and
compilers available for these languages, it’s inevitable that data scientists
may choose a different setup than their colleagues. If libraries or functions
exist in one environment but not the other, it could create complications such
as code not running in different environments because of missing functions,
or erroneous outputs.

With Fabric, the IDE serves as an input, so you have the flexibility to use the
IDE which has the best core functionality for you, and we’ll provide the
relevant financial analysis packages and libraries.

Research and develop analytics

With tens of thousands of libraries available for analysis, from numpy and
scikit to plotly and pandas, it can be difficult to choose and create consensus
around the most appropriate ones. Rather than dealing with separate
libraries from separate authors (which can become outdated and hard to
maintain) our platform has preconfigured tools for comprise back testing,
calculating financial ratios and charting stock prices and fundamentals.

10
Continued on the next page
Data Fabric by Nasdaq Data Link

Manage Git repositories and compute services

Managing code changes and setting up compute services can be timely and
costly. Your data scientists shouldn’t be worried about job or workflow
scheduling, configuring clusters, or how many nodes, processor cores and
RAM they need.

With Data Fabric, we’ve set up efficient compute services for dataframe
analysis, machine learning with tensorflow and other financial workloads.
Your workflows scale efficiently due to Nasdaq’s partnership with Databricks.

Monitor & maintain delivery and access

Just like data coming in from many sources, good analysis will flow out to
different use cases. Building a delivery monitoring system to ensure data is
getting where it needs to be can be complex. Fabric has built an efficient
monitoring system with alerts, SLI metrics and dashboards to monitor
trends, identify issues and feed analysis to the right products and users.

11
Continued on the next page
Data Fabric by Nasdaq Data Link

Share & Manage

Beyond data ingestion and deployment, Data Fabric comprises a multitude
of features that comprises the full data lifecycle; from lineage to usage, that
ultimately allows for the efficient maintenance of data deliver within an
organization.

Starting with data discovery within the organization after the dataset has
already been deployed. Ensuring that data is easily discovered in a virtual
catalogue (with an interface not unlike Nasdaq Data Link) makes for an ideal
user interface. As a new investment manager or researcher starting within
an organization, knowing where to go to see all of its data will save countless
hours simply finding out what’s available. The alternative of which is merely
another system requiring maintenance.

With all of the data now centralized on a single platform, administering

access to it is another key strength of Data Fabric, allowing the team
responsible for deciding who has access to what dataset to easily provide
and control the amount and depth of permission. Centralizing administration
also allows the organization to automatically track and record granular
usage of a dataset for audits, compliance and governance purposes.

Get value from your data

faster with Data Fabric
data.nasdaq.com/datafabric

REQUEST A CUSTOM DEMONSTRATION

12
Continued on the next page Nasdaq Data Link
[email protected]

LLM Based Text To SQL
No ratings yet
LLM Based Text To SQL
9 pages
Data Lake Bootcamp: Building Reliable Data Lakes
100% (1)
Data Lake Bootcamp: Building Reliable Data Lakes
29 pages
Sap Basis Training Material
No ratings yet
Sap Basis Training Material
4 pages
Automated Theme Park Management System DBMS
No ratings yet
Automated Theme Park Management System DBMS
18 pages
The Data Fabric Handbook
100% (1)
The Data Fabric Handbook
9 pages
DataStax Ebook The 5 Main Benefits of Apache Cassandra PDF
100% (1)
DataStax Ebook The 5 Main Benefits of Apache Cassandra PDF
12 pages
Real Time Analytics With Apache Kafka and Spark: Rahul Jain
100% (1)
Real Time Analytics With Apache Kafka and Spark: Rahul Jain
54 pages
Getting Started nRF5SDK Ses
No ratings yet
Getting Started nRF5SDK Ses
39 pages
Use Delta Lake in Azure Synapse Analytics
No ratings yet
Use Delta Lake in Azure Synapse Analytics
37 pages
Cassandra: A Distributed Database With No Single Point of Failure
100% (1)
Cassandra: A Distributed Database With No Single Point of Failure
9 pages
Python OOP for Beginners
No ratings yet
Python OOP for Beginners
36 pages
Ble-Sniffer - Win - 1.2 - User Guide
No ratings yet
Ble-Sniffer - Win - 1.2 - User Guide
17 pages
Presentation Cassandra Datastax
100% (1)
Presentation Cassandra Datastax
151 pages
Hive and Presto For Big Data
100% (1)
Hive and Presto For Big Data
31 pages
Kafka Streams for Data Engineers
100% (1)
Kafka Streams for Data Engineers
93 pages
AgenticAIv2.0 Compressed
100% (1)
AgenticAIv2.0 Compressed
25 pages
Kafka Core Concepts Guide
100% (1)
Kafka Core Concepts Guide
76 pages
AMCAT Test Syllabus
No ratings yet
AMCAT Test Syllabus
9 pages
Optimize LLM Output: Top 7 Parameters
100% (1)
Optimize LLM Output: Top 7 Parameters
9 pages
Accelerating Data Modernization With Azure
No ratings yet
Accelerating Data Modernization With Azure
7 pages
Data Lakehouse
No ratings yet
Data Lakehouse
7 pages
Bring Data Lakes and Data Warehouses Together
100% (1)
Bring Data Lakes and Data Warehouses Together
19 pages
Library Stock List
No ratings yet
Library Stock List
1,032 pages
Unleashing The Power of AI - Whitepaper 2024
No ratings yet
Unleashing The Power of AI - Whitepaper 2024
27 pages
Turbonomic User Guide 8.5.0
100% (1)
Turbonomic User Guide 8.5.0
452 pages
SerDes PDF
No ratings yet
SerDes PDF
14 pages
Running HashiCorp Vault in Production (Dan McTeer, Bryan Krausen) (Z-Library)
100% (1)
Running HashiCorp Vault in Production (Dan McTeer, Bryan Krausen) (Z-Library)
276 pages
Gigaom Radar For Network Observability
No ratings yet
Gigaom Radar For Network Observability
26 pages
Data Fabric Architecture Fundamentals
100% (1)
Data Fabric Architecture Fundamentals
36 pages
Data-Level Parallelism in Vector, SIMD, And: GPU Architectures
100% (1)
Data-Level Parallelism in Vector, SIMD, And: GPU Architectures
29 pages
VHDL-AMS Simulation of RF Mixed-Signal Communication Systems
No ratings yet
VHDL-AMS Simulation of RF Mixed-Signal Communication Systems
19 pages
Introduction To Snowflake Warehouses
No ratings yet
Introduction To Snowflake Warehouses
40 pages
Guide To Understanding FedRAMP 061312
100% (2)
Guide To Understanding FedRAMP 061312
53 pages
Higher Engineering Mathematics BS Grewal
No ratings yet
Higher Engineering Mathematics BS Grewal
1,327 pages
Modernizing IBM I Applications
100% (1)
Modernizing IBM I Applications
284 pages
PSO Data Analytics Day 1
100% (1)
PSO Data Analytics Day 1
106 pages
Tungban Machine Learning Math Course
No ratings yet
Tungban Machine Learning Math Course
124 pages
NRF Sniffer UG v2.2 PDF
No ratings yet
NRF Sniffer UG v2.2 PDF
21 pages
Data Ready Ai
No ratings yet
Data Ready Ai
8 pages
Tmax Messages
No ratings yet
Tmax Messages
466 pages
Quantum Computing Lecture Notes
No ratings yet
Quantum Computing Lecture Notes
57 pages
REPEAT 2 Architecture Patterns For Multi-Region Active-Active ARC213-R2
No ratings yet
REPEAT 2 Architecture Patterns For Multi-Region Active-Active ARC213-R2
91 pages
Data Lakes For Maximum Flexibility
No ratings yet
Data Lakes For Maximum Flexibility
29 pages
NVIDIA NIM Customer Deck - Partners
No ratings yet
NVIDIA NIM Customer Deck - Partners
12 pages
SCP80 OTA - SGP - 05 - v1 - 1
No ratings yet
SCP80 OTA - SGP - 05 - v1 - 1
127 pages
Data Engineering Design Patterns
100% (1)
Data Engineering Design Patterns
53 pages
Software Engineering: Chapter 6-Data Flow Diagram
No ratings yet
Software Engineering: Chapter 6-Data Flow Diagram
32 pages
Cheat Sheet AWS Solutions Architect Professional
No ratings yet
Cheat Sheet AWS Solutions Architect Professional
177 pages
DATA SHEET Cloud Data Management
No ratings yet
DATA SHEET Cloud Data Management
2 pages
IOT Architecture II
No ratings yet
IOT Architecture II
29 pages
Aws Glue Consulting - Helical IT Solutions
No ratings yet
Aws Glue Consulting - Helical IT Solutions
3 pages
Technologies For Handling Big Data: Prepared By: Saidatul Rahah Hamidi
No ratings yet
Technologies For Handling Big Data: Prepared By: Saidatul Rahah Hamidi
49 pages
IBM Data Fabric Perspective
No ratings yet
IBM Data Fabric Perspective
11 pages
Data Fabric: Smart Data Engineering, Operations, and Orchestration
100% (2)
Data Fabric: Smart Data Engineering, Operations, and Orchestration
26 pages
Enhanced Data Fabric Insights
100% (2)
Enhanced Data Fabric Insights
26 pages
Data Analytics Essentials
No ratings yet
Data Analytics Essentials
10 pages
What Is A Data Fabric - IBM
No ratings yet
What Is A Data Fabric - IBM
8 pages
Data Fabric Whitepaper
No ratings yet
Data Fabric Whitepaper
12 pages
IDERA Creating A Data Fabric For Analytical Data
No ratings yet
IDERA Creating A Data Fabric For Analytical Data
15 pages
Data Fabric
No ratings yet
Data Fabric
2 pages
Ebook - SL - Data Mesh or Data Fabric
No ratings yet
Ebook - SL - Data Mesh or Data Fabric
10 pages
Business Data Fabric - What Is A Business Data Fabric
No ratings yet
Business Data Fabric - What Is A Business Data Fabric
14 pages
How To Define Build and Operationalize A Data Fabric
100% (1)
How To Define Build and Operationalize A Data Fabric
51 pages
Python Practical File Guide
No ratings yet
Python Practical File Guide
37 pages
MaxDB - Administration
No ratings yet
MaxDB - Administration
20 pages
Promax 2D Seismic Processing and Analysis: 626080 Rev. B May 1998
No ratings yet
Promax 2D Seismic Processing and Analysis: 626080 Rev. B May 1998
47 pages
Dashboard Tools With Kyubit BI User Manual
No ratings yet
Dashboard Tools With Kyubit BI User Manual
134 pages
PLSQL 3 3 SG
No ratings yet
PLSQL 3 3 SG
23 pages
Week 1:: Data Structure and Algorithm
No ratings yet
Week 1:: Data Structure and Algorithm
66 pages
File Renaming Utility Guide
No ratings yet
File Renaming Utility Guide
23 pages
How To Diagnose A Problem in The Item Catalog PDF
No ratings yet
How To Diagnose A Problem in The Item Catalog PDF
8 pages
CICS Desk Reference Programs: Program Files and Mapsets
No ratings yet
CICS Desk Reference Programs: Program Files and Mapsets
2 pages
IBM Data Science Certificate
No ratings yet
IBM Data Science Certificate
1 page
Power BI Course Syllabus - by Murali P N, Besant Technologies
No ratings yet
Power BI Course Syllabus - by Murali P N, Besant Technologies
6 pages
CH 8-Interfacing Python With Mysql (Connectivity) For Board Exam
No ratings yet
CH 8-Interfacing Python With Mysql (Connectivity) For Board Exam
14 pages
SQL Server 2005: DBA Enhancements
No ratings yet
SQL Server 2005: DBA Enhancements
3 pages
Different Types of Commands in SQL
No ratings yet
Different Types of Commands in SQL
3 pages
JDBC Learning for Students
No ratings yet
JDBC Learning for Students
9 pages
BBA Database Security Guide
No ratings yet
BBA Database Security Guide
16 pages
AyushiPatra Resume
No ratings yet
AyushiPatra Resume
1 page
Errors That Can Occur When You Run A Report From Tigerpaw
No ratings yet
Errors That Can Occur When You Run A Report From Tigerpaw
22 pages
Information Technology Class 10 CH 1 Question Answer
No ratings yet
Information Technology Class 10 CH 1 Question Answer
6 pages
Linked Lists
No ratings yet
Linked Lists
1 page
Distributed DB Transaction Guide
100% (1)
Distributed DB Transaction Guide
9 pages
Ems SQL Storage
No ratings yet
Ems SQL Storage
5 pages
SQL Views and Procedures for BikeStores
No ratings yet
SQL Views and Procedures for BikeStores
10 pages
IGNOU MCS-021 Data Structures Assignment
No ratings yet
IGNOU MCS-021 Data Structures Assignment
22 pages
LTE Formula
No ratings yet
LTE Formula
22 pages
The Data WareHouse ETL Toolkit - Chapter 05
100% (1)
The Data WareHouse ETL Toolkit - Chapter 05
40 pages
Student Assignment Guide
No ratings yet
Student Assignment Guide
10 pages

Nasdaq Data Link Data Fabric

Uploaded by

Nasdaq Data Link Data Fabric

Uploaded by

Data Fabric

by Nasdaq Data Link

Enabling Rapid Data

Nasdaq Data Link powers data-driven decision-making for over

Mapping the data deployment odyssey

Monitor & maintain Research & develop Set up compute

Finally, we’ll review the critical, easy-to-use, proprietary tooling we’ve

Ingest & Unify

Websites Cloud Storage Hadoop Clusters Local Server

Raw data is often messy,

Data Fabric deploys

Once tagging is complete, a catalogue or index of the data becomes

A factored catalogue allows

Data classification is an important step at this stage in our process,

Sensitive field Per user access

Private field Per department

Set up compute environment

Research and develop analytics

Manage Git repositories and compute services

Monitor & maintain delivery and access

Share & Manage

With all of the data now centralized on a single platform, administering

Get value from your data

REQUEST A CUSTOM DEMONSTRATION

You might also like