Thursday, February 25, 2021
  • Setup menu at Appearance » Menus and assign menu to Top Bar Navigation
Advertisement
  • AI Development
    • Artificial Intelligence
    • Machine Learning
    • Neural Networks
    • Learn to Code
  • Data
    • Blockchain
    • Big Data
    • Data Science
  • IT Security
    • Internet Privacy
    • Internet Security
  • Marketing
    • Digital Marketing
    • Marketing Technology
  • Technology Companies
  • Crypto News
No Result
View All Result
NikolaNews
  • AI Development
    • Artificial Intelligence
    • Machine Learning
    • Neural Networks
    • Learn to Code
  • Data
    • Blockchain
    • Big Data
    • Data Science
  • IT Security
    • Internet Privacy
    • Internet Security
  • Marketing
    • Digital Marketing
    • Marketing Technology
  • Technology Companies
  • Crypto News
No Result
View All Result
NikolaNews
No Result
View All Result
Home Big Data

TileDB introduces canonical database storage format

July 22, 2020
in Big Data
TileDB introduces canonical database storage format
586
SHARES
3.3k
VIEWS
Share on FacebookShare on Twitter

Ever since the revelation that not all data can be neatly stored in rows and columns, it seems that barely a day goes by without emergence of yet another new database with its own query engine and unique table or file format. TileDB has merged to in effect, scream, “Stop the insanity” with its quest to establish arrays as a new form of universal storage format.

Unlike most database CEOs, TileDB founder Stavros Papadopoulos comes from the scientific, not the technology community. What eventually became TileDB originated out of yet another Michael Stonebraker MIT project, SciDB, that offered a database engine suitable for use by research scientists because of its array structure, now commercially available as Paradigm 4. Because the data is not force fit into columns and rows, it can represent almost any kind of data structure – and commercially it has been used to build multi-dimensional arrays that have some resemblance to the early generation of denormalized MOLAP databases.

You might also like

Off-chain reporting: Toward a new general purpose secure compute framework by Chainlink

Cutting-edge Katana Graph scores $28.5 million Series A Led by Intel Capital

Hasura connects GraphQL to the REST of the world

But Papadopoulos identified one key drawback to SciDB – it could not handle data sparsity very well. That’s where many columns are empty or null, a scenario that is quite common for genomic data sets focusing on how species or individuals are differentiated from one another; for people, the typical deviation across the human genome is barely 0.1%. Theoretically, you could store all the redundant data, but that would be a huge waste of resource; so as a result, most genomic data sets are highly sparse.

So founder Papadopoulos left the ivory tower at MIT and, initially backed with seed funding form Intel Capital, started TileDB. It picks up where SciDB leaves off by building sparsity into its optimizations, and unlike most databases, concentrates entirely on data storage and management, but leaves the compute/query engine as pluggable. That’s the reverse of what databases like MySQL and MariaDB do, where they feature a common compute tier but make the storage engine pluggable. So, for instance, TileDB versions data, supports “time traveling” (we presume, through snapshots), and handles housekeeping tasks such as access control, logging, and managing metadata.

Yet in some ways, TileDB follows a very similar design pattern in the cloud database world, where the storage engine is common but exposed through different APIs. Microsoft Cosmos DB is the best known public example of this approach, having a core storage tier with APIs for SQL, JSON, graph, and wide column. Additionally, Amazon Aurora and Keyspaces, along with Google Cloud Spanner and Cloud Datastore, all run against storage engines via APIs.

TileDB offers two products. It includes TileDB Embedded, an open-source, cloud-native and  storage library for multi-dimensional arrays and TileDB Cloud, a serverless SaaS offering for sharing data and code and enabling efficient computations that currently runs on AWS and uses S3 for physical storage.

By leveraging cloud storage, abstracting the compute and query engine, and with a cloud offering that is designed to be serverless, TileDB is promoting its ability to scale. Having recently announced $15 million in Series A funding, the company is initially targeting use cases in genomics and geospatial.

Credit: Zdnet

Previous Post

Machine learning analysis reveals less diverse microbiome in children with type 1 diabetes

Next Post

Reinforcement Learning Starts to Deliver on Its Promise

Related Posts

Off-chain reporting: Toward a new general purpose secure compute framework by Chainlink
Big Data

Off-chain reporting: Toward a new general purpose secure compute framework by Chainlink

February 25, 2021
Cutting-edge Katana Graph scores $28.5 million Series A Led by Intel Capital
Big Data

Cutting-edge Katana Graph scores $28.5 million Series A Led by Intel Capital

February 24, 2021
Hasura connects GraphQL to the REST of the world
Big Data

Hasura connects GraphQL to the REST of the world

February 23, 2021
As Power BI aces Gartner’s new Magic Quadrant, what’s the story behind Microsoft’s success?
Big Data

As Power BI aces Gartner’s new Magic Quadrant, what’s the story behind Microsoft’s success?

February 19, 2021
Google Cloud adds new hybrid file storage partnership with Nasuni
Big Data

Google Cloud adds new hybrid file storage partnership with Nasuni

February 18, 2021
Next Post
Reinforcement Learning Starts to Deliver on Its Promise

Reinforcement Learning Starts to Deliver on Its Promise

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recommended

Plasticity in Deep Learning: Dynamic Adaptations for AI Self-Driving Cars

Plasticity in Deep Learning: Dynamic Adaptations for AI Self-Driving Cars

January 6, 2019
Microsoft, Google Use Artificial Intelligence to Fight Hackers

Microsoft, Google Use Artificial Intelligence to Fight Hackers

January 6, 2019

Categories

  • Artificial Intelligence
  • Big Data
  • Blockchain
  • Crypto News
  • Data Science
  • Digital Marketing
  • Internet Privacy
  • Internet Security
  • Learn to Code
  • Machine Learning
  • Marketing Technology
  • Neural Networks
  • Technology Companies

Don't miss it

Ukraine reports cyber-attack on government document management system
Internet Security

Ukraine reports cyber-attack on government document management system

February 25, 2021
KPMG, BitGo, and Coin Metrics launch combined offering for public blockchains
Blockchain

KPMG, BitGo, and Coin Metrics launch combined offering for public blockchains

February 25, 2021
IBM Reportedly Retreating from Healthcare with Watson 
Artificial Intelligence

IBM Reportedly Retreating from Healthcare with Watson 

February 25, 2021
Using machine learning to identify blood biomarkers for early diagnosis of autism
Machine Learning

Using machine learning to identify blood biomarkers for early diagnosis of autism

February 25, 2021
Label a Dataset with a Few Lines of Code | by Eric Landau | Jan, 2021
Neural Networks

Label a Dataset with a Few Lines of Code | by Eric Landau | Jan, 2021

February 25, 2021
How to Identify and Prioritize Marketing Ideas
Marketing Technology

How to Identify and Prioritize Marketing Ideas

February 25, 2021
NikolaNews

NikolaNews.com is an online News Portal which aims to share news about blockchain, AI, Big Data, and Data Privacy and more!

What’s New Here?

  • Ukraine reports cyber-attack on government document management system February 25, 2021
  • KPMG, BitGo, and Coin Metrics launch combined offering for public blockchains February 25, 2021
  • IBM Reportedly Retreating from Healthcare with Watson  February 25, 2021
  • Using machine learning to identify blood biomarkers for early diagnosis of autism February 25, 2021

Subscribe to get more!

© 2019 NikolaNews.com - Global Tech Updates

No Result
View All Result
  • AI Development
    • Artificial Intelligence
    • Machine Learning
    • Neural Networks
    • Learn to Code
  • Data
    • Blockchain
    • Big Data
    • Data Science
  • IT Security
    • Internet Privacy
    • Internet Security
  • Marketing
    • Digital Marketing
    • Marketing Technology
  • Technology Companies
  • Crypto News

© 2019 NikolaNews.com - Global Tech Updates