Friday, February 26, 2021
  • Setup menu at Appearance » Menus and assign menu to Top Bar Navigation
Advertisement
  • AI Development
    • Artificial Intelligence
    • Machine Learning
    • Neural Networks
    • Learn to Code
  • Data
    • Blockchain
    • Big Data
    • Data Science
  • IT Security
    • Internet Privacy
    • Internet Security
  • Marketing
    • Digital Marketing
    • Marketing Technology
  • Technology Companies
  • Crypto News
No Result
View All Result
NikolaNews
  • AI Development
    • Artificial Intelligence
    • Machine Learning
    • Neural Networks
    • Learn to Code
  • Data
    • Blockchain
    • Big Data
    • Data Science
  • IT Security
    • Internet Privacy
    • Internet Security
  • Marketing
    • Digital Marketing
    • Marketing Technology
  • Technology Companies
  • Crypto News
No Result
View All Result
NikolaNews
No Result
View All Result
Home Big Data

For CockroachDB, transactions first, then analytics is a feature — not a bug

February 27, 2019
in Big Data
For CockroachDB, transactions first, then analytics is a feature — not a bug
586
SHARES
3.3k
VIEWS
Share on FacebookShare on Twitter

Credit: ZDnet

Cloud native databases. Serverless databases. However you want to call them, there’s a new breed of databases on the rise. One that promises automatic scalability on a global scale: No more toiling over configuration, management, replication and the like, just spin some instances in the cloud and go.

You might also like

DataStax Astra goes serverless | ZDNet

Off-chain reporting: Toward a new general purpose secure compute framework by Chainlink

Cutting-edge Katana Graph scores $28.5 million Series A Led by Intel Capital

Also: Open source AI chips making Green Waves: Bringing energy efficiency to IoT architecture

But although data is naturally gravitating to the cloud, not everyone is willing and able to move all data there. So databases these days are also increasingly expected to be able to handle workloads seamlessly both on premise and across a multitude of clouds.

And then, there’s also what by now seems like an old dilemma: to SQL, or to NoSQL? While upending the traditional design of relational databases has brought benefits in terms of scalability, replacing SQL is not necessarily something people want.

So how does one combine SQL, cloud native, multi-cloud, and hybrid cloud?

Multi-cloud, hybrid cloud, meet SQL

Unsurprisingly, there are a few cloud native database offerings around from cloud vendors. Some of them, like Azure CosmosDB, Google Spanner, and AWS Aurora, also support SQL. Clearly, none of them is multi-cloud. 

Also: 2019 will be another ‘Year of the Graph’: OpenCorporates is evidence No. 1

There are also a few multi-cloud databases around, the likes of DataStax Enterprise or MongoDB, making a play to capitalize on this strength, and adding serverless features. Being open source is a common trait among such offerings. They typically do not have SQL support though.

But are there options that are cloud native, support multi-cloud and hybrid cloud, SQL, and open source? A few, and CockroachDB is notable among them.

If you’re wondering what’s with the name, it’s a nod to resilience, something for which cockroaches are notorious for. Cockroach Labs, home of open source CockroachDB, was founded in 2015 by ex-Googlers Spencer Kimball, Peter Mattis, and Ben Darnell. While at Google, they had all used Bigtable and were acquainted with its successor, Spanner. Then, they set out to build something that can do what Spanner can, and more. 

Being resilient is where CockroachDB got its name from. But there’s more in this database that’s interesting besides the name.

ZDNet had a Q&A with Kimball, Cockroach Labs CEO, to discuss where they are in their journey, and what’s coming next. Since 2015m Cockroach Labs has grown to almost 100 employees, moved to a new NYC office, and opened three new offices in Seattle, Boston, and San Francisco, and raised a total of $53.5M over three rounds.

Kimball said they spent much of the first three years architecting and implementing the core product, with roughly 80 percent of employees focused on R&D. More recently, they have built out customer support, marketing, people ops, and sales teams, and are now closer to 50 percent of headcount devoted to R&D. 

These all sound like signs of growing up. But where does CockroachDB stand compared to the competition? Kimball thinks it fundamentally comes down to the capabilities of a geo-distributed SQL RDBMS, but offered in a way that provides flexibility to customers who either can’t or don’t want to go all-in and embrace a proprietary offering from one cloud vendor.

Geo-distribution for the win

But while differentiation from solutions offered by cloud and NoSQL vendors is clear, CockroachDB is not the only cloud-native, multi/hybrid cloud, SQL game in town. What sets it apart, according to Kimball, is geo-distribution: 

“We’ve spent a lot of time implementing CockroachDB from the ground up to provide truly geo-distributed SQL. More recent entrants to the cloud-native SQL market are either not geo-distributed (TiDB, Citus), or the SQL aspect is a monolithic head that’s been affixed to a distributed body (Yugabyte, FoundationDB, Aurora).

Geo-distributed yields two fundamental advantages: resilience, which can tolerate datacenter and even region-level failures, and data domiciling, which can keep data close to the customer for latency and privacy. Of the big vendors, only Spanner and Aurora provide the same resilience model, though Aurora’s is limited to a single region. No other database vendors yet provide the data domiciling capabilities which CockroachDB offers.”

webui-cluster-overview-cockroachdb.png

Geo-distribution is a key feature for CockroachDB

Also: Habana, the AI chip innovator, promises top performance and efficiency

Kimball noted that when they started the company, they weren’t yet sure where CockroachDB would fit into the ecosystem, or what kinds of companies would be willing and able to move to a new RDBMS. He went on to add, however, that in 2018 they began to answer those questions and ended with an impressive first year of revenue: 

“It turns out that much of the Fortune 2000 is struggling with often board-level mandates to embrace the benefits of the public cloud. That modernization process opens the door to consideration of alternatives to Oracle, especially databases better suited to exploiting the opportunities inherent in the cloud.

Where CockroachDB has a big strategic advantage over the likes of AWS Aurora or Google Cloud Spanner is that we offer a bridge from the reality of existing on-premise deployments to the desired outcome of using the public cloud wherever it makes sense. CockroachDB can be run on-premise, hybrid, and across arbitrary cloud vendors.”

Business, meet open source. Open source, meet the cloud.

This brings us to an interesting topic: Competition with cloud vendors, and the commons clause. This is something many open source vendors software are facing, as cloud vendors are taking their products and offering them as managed services, directly competing with said software vendors. In response, software vendors are modifying their licenses to prevent this. Kimball acknowledged this as a huge problem: 

“Just when everyone thought a stable business model had evolved for open source businesses, AWS perfected their strip mining operation. We don’t yet face the same set of conditions that Confluent does, both in terms of market adoption and direct competition from AWS with our core open source product. Additionally, stewardship of Kafka by the Apache Foundation introduces some of the complexity in Confluent’s licensing scheme.

Nonetheless, we must address the same root problem. We were one of the first companies to introduce a source-available enterprise license and to date, have been careful to apply it only to features which are useful chiefly to companies which really should be paying us.”

istock-485587762.jpg

Open source is an enabler for enterprise software, but the competition from cloud vendors is creating friction and causing license modifications


boygovideo, Getty Images/iStockphoto

Kimball also noted that the competitive behavior of AWS will put pressure on them to add features that traditionally would be pure open source to the “free” category of our enterprise license, and to add an exclusion for AWS-like behavior.

Business seems to be going well for Cockroach Labs, despite the competition. We would also add names such as NuoDB to that list, although NuoDB is not an open source offering. CockroachDB comes int three flavors: Core, Enterprise, and Managed. Features such as Geo-Partitioning, distributed backup and restore, and extra security are part of the Enterprise version. 

Also: Confluent shows open source, paradigm shifts, cloud, and commercial success can all co-exist

Names such as Comcast and Baidu are listed as CockroachDB users. Having a precedent with one of the BAT using open source Apache Flink, and eventually acquiring data Artisans, the vendor offering support for it, we were curious about Cockroach Labs’ relationship with Baidu. Kimball said Baidu turned to CockroachDB to replace its sharded MySQL with a distributed database that scales horizontally while providing the familiar SQL interface.

On being cloud native, Kubernetes, streaming, analytics, HTAP, and the future of CockroachDB

Inevitably, the discussion touched upon the impact of Kubernetes on databases as well. Cockroach Labs is a member of CNCF, and Kimball believes Kubernetes, more than any other CNCF project, illuminates a path to the desired outcome: 

“It can be run on-premise, and natively on all the cloud vendors, giving operations a consistent control plane across environments. This is why it’s one of the fastest growing open source projects of all time.

However, compared to Borg, the project within Google which inspired it, Kubernetes is still in its infancy. It’s struggled recently to handle stateful services. Additional tools will be necessary to orchestrate multiple Kubernetes clusters across regions or cloud providers. These capabilities are critical to enabling CockroachDB’s features such as geo-replication and geo-partitioning.”

kubernetes-part2-zoevandijk.jpg

CockroachDB has a big stake on Kubernetes, and on being cloud native with automatic automatic geo-scaling

While CockroachDB has a big stake in Kubernetes, what about features such as support for streaming and analytics? Is going HTAP something we can expect to see? Kimball said they are tackling transactional use cases first and foremost, but intend to build on that: 

“It’s a $45 billion market, so a juicy target. Providing a cloud-native, geo-distributed system of record is a position of strength from which we will expand into a formidable HTAP offering. This is a key difference from the strategy employed by Snappy Data or Splice Machine.

Those products have chosen to tackle HTAP as a distinct product category, whereas we believe strongly that the product category that matters is actually the OLTP system of record, and adding better analytics to that is a downhill journey.”

Kimball said they are in the process of vectorizing SQL execution and upgrading the underlying storage system to optimize for analytical workloads. In the meantime, distributed change data capture allows changes in the database to be transactionally streamed in real time to cloud storage or to Kafka. This allows CockroachDB to be paired with data warehousing or BI solutions. Native integration with Kafka is one way, but there’s a JDBC sink connector for Kafka that CockroachDB is compatible with out of the box.

Also: Start the reskilling revolution without me: Future of Work trends and soft data on soft skills

Kimball concluded by mentioning features such as full-text and geo-spatial indexes as likely future additions, and noted they routinely consider adding graph capabilities as well. In any case, CockroachDB is part of a growing class of databases that want to have it all, and is worth keeping an eye on.

Related stories:

Credit: ZDnet

Previous Post

PUBG using machine learning against cheating

Next Post

Learn Ethical Hacking with 180 Hours of Training — 2019 Course Bundle

Related Posts

DataStax Astra goes serverless | ZDNet
Big Data

DataStax Astra goes serverless | ZDNet

February 25, 2021
Off-chain reporting: Toward a new general purpose secure compute framework by Chainlink
Big Data

Off-chain reporting: Toward a new general purpose secure compute framework by Chainlink

February 25, 2021
Cutting-edge Katana Graph scores $28.5 million Series A Led by Intel Capital
Big Data

Cutting-edge Katana Graph scores $28.5 million Series A Led by Intel Capital

February 24, 2021
Hasura connects GraphQL to the REST of the world
Big Data

Hasura connects GraphQL to the REST of the world

February 23, 2021
As Power BI aces Gartner’s new Magic Quadrant, what’s the story behind Microsoft’s success?
Big Data

As Power BI aces Gartner’s new Magic Quadrant, what’s the story behind Microsoft’s success?

February 19, 2021
Next Post
Learn Ethical Hacking with 180 Hours of Training — 2019 Course Bundle

Learn Ethical Hacking with 180 Hours of Training — 2019 Course Bundle

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recommended

Plasticity in Deep Learning: Dynamic Adaptations for AI Self-Driving Cars

Plasticity in Deep Learning: Dynamic Adaptations for AI Self-Driving Cars

January 6, 2019
Microsoft, Google Use Artificial Intelligence to Fight Hackers

Microsoft, Google Use Artificial Intelligence to Fight Hackers

January 6, 2019

Categories

  • Artificial Intelligence
  • Big Data
  • Blockchain
  • Crypto News
  • Data Science
  • Digital Marketing
  • Internet Privacy
  • Internet Security
  • Learn to Code
  • Machine Learning
  • Marketing Technology
  • Neural Networks
  • Technology Companies

Don't miss it

Attorney-General urged to produce facts on US law enforcement access to COVIDSafe
Internet Security

Attorney-General urged to produce facts on US law enforcement access to COVIDSafe

February 26, 2021
Machine Learning & Big Data Analytics Education Market: Soaring Demand Assures Motivated Revenue Share During 2020-2030 – KSU
Machine Learning

Machine Learning & Big Data Analytics Education Market: Soaring Demand Assures Motivated Revenue Share During 2020-2030 – KSU

February 26, 2021
This chart shows the connections between cybercrime groups
Internet Security

This chart shows the connections between cybercrime groups

February 26, 2021
Basic laws of physics spruce up machine learning
Machine Learning

New machine learning tool facilitates analysis of health information, clinical forecasting

February 26, 2021
Creative Destruction and Godlike Technology in the 21st Century | by Madhav Kunal
Neural Networks

Creative Destruction and Godlike Technology in the 21st Century | by Madhav Kunal

February 26, 2021
Spy agency: Artificial intelligence is already a vital part of our missions
Internet Security

Spy agency: Artificial intelligence is already a vital part of our missions

February 26, 2021
NikolaNews

NikolaNews.com is an online News Portal which aims to share news about blockchain, AI, Big Data, and Data Privacy and more!

What’s New Here?

  • Attorney-General urged to produce facts on US law enforcement access to COVIDSafe February 26, 2021
  • Machine Learning & Big Data Analytics Education Market: Soaring Demand Assures Motivated Revenue Share During 2020-2030 – KSU February 26, 2021
  • This chart shows the connections between cybercrime groups February 26, 2021
  • New machine learning tool facilitates analysis of health information, clinical forecasting February 26, 2021

Subscribe to get more!

© 2019 NikolaNews.com - Global Tech Updates

No Result
View All Result
  • AI Development
    • Artificial Intelligence
    • Machine Learning
    • Neural Networks
    • Learn to Code
  • Data
    • Blockchain
    • Big Data
    • Data Science
  • IT Security
    • Internet Privacy
    • Internet Security
  • Marketing
    • Digital Marketing
    • Marketing Technology
  • Technology Companies
  • Crypto News

© 2019 NikolaNews.com - Global Tech Updates