•   
  •   
  •   

Technology Trifacta goes all in on the cloud

18:10  06 april  2021
18:10  06 april  2021 Source:   zdnet.com

With a $10 billion cloud-computing deal snarled in court, the Pentagon may move forward without it

  With a $10 billion cloud-computing deal snarled in court, the Pentagon may move forward without it The Joint Enterprise Defense Infrastructure contract, known as JEDI, has been bogged down in litigation for years. The Pentagon is finding ways to move on without it. The weight of two long-running lawsuits has the Defense Department openly questioning whether JEDI will still be worth the trouble if the court seeks to depose former president Donald Trump and other former officials. And the success of some of the Pentagon’s other, quieter cloud initiatives is lending new weight to the idea that JEDI’s one-cloud-to-rule-them-all strategy may not be necessary in the first place.

diagram © ZDNet

Trifacta, which has become the last pure play data prep tools provider still standing, sees its future as a broader based cloud software-as-a-service (SaaS) service. This week, it is unveiling a new Data Engineering Cloud that will deliver a fully managed service on each of the major clouds. That will be in addition to, not instead of Wrangler, its long-established on-premises prep suite.

Top Cloud Providers

a screen shot of a person: Top cloud providers: AWS, Microsoft Azure, and Google Cloud, hybrid, SaaS players © Provided by ZDNet Top cloud providers: AWS, Microsoft Azure, and Google Cloud, hybrid, SaaS players

Top cloud providers: AWS, Microsoft Azure, and Google Cloud, hybrid, SaaS players

Here's a look at how the cloud leaders stack up, the hybrid market, and the SaaS players that run your company as well as their latest strategic moves.

Meghan Markle's Friends and Colleagues React to Bullying Claims

  Meghan Markle's Friends and Colleagues React to Bullying Claims Meghan Markle's Friends and Colleagues React to Bullying Claims

Read More

Trifacta's niche will continue to be serving as the front end design studio where the data engineer, data scientist, or business developer creates the "recipes" for data preparation and transformation. The Trifacta Data Engineering Cloud will extend beyond data prep to encompass cleansing, validation, profiling, and the monitoring of data pipelines. But those pipelines will run in the downstream execution tool of choice. The Trifacta Data Engineering Cloud service won't replace the Databricks or Snowflakes of the world, but instead let users run data prep inside them. And, as for Databricks, Trifacta is also announcing today that it is taking the partnership up a notch with native integration of its data prep pipelines into the Lakehouse platform that is built around Delta Lake.

Push-down query capabilities: Five questions to ask your cloud BI provider

  Push-down query capabilities: Five questions to ask your cloud BI provider How do organizations balance the need for a hybrid environment with all the benefits they can get from an elastic, modern cloud-based BI platform? The answer is: a push-down query capability. Top cloud providers: AWS, Microsoft Azure, and Google Cloud, hybrid, SaaS players

In the run-up to the announcement, Trifacta has had a good dress rehearsal for the SaaS service as the OEM partner behind Google Cloud Dataprep. The GCP offering put the Trifacta suite on a cloud-native platform running on Kubernetes (K8s), and while it was initially focused on ELT working with Google BigQuery and cloud storage, it recently added a premium tier that added support for non-Google data sources such as Oracle, SQL Server, MySQL, PostgreSQL, and salesforce.com. The premium edition serves as a prelude to the new Trifacta Data Engineering Cloud offering, which also takes advantage of the microservices and K8s architecture of the Google offering to provide the cookie cutter template for rollout to other clouds.

Beyond multi-cloud support, the Trifacta offering broadens beyond the no-code, drag and drop tool for business analyst to provide multiple pathways for designing data preparation. It now offers three views. It includes the original "grid" view, that provided the spreadsheet view for data preparation tasks, where values were reconciled to the right columns. Then it adds a flow view, which shows the entity relationships familiar to SQL developers, and the "code" view that is suited for Python programmers. While SQL developers can use DBT (Data Building tool) for writing transformations using SQL Select statements, data scientists can write transforms in Python from their Jupyter notebooks; the results populate Trifacta recipes that are handed down to execution environments. A rich library of 180+ connectors are also provided. Once the recipes are created, they can be integrated into the data pipelines or workflows of external tools or services, such as Databricks, through APIs.

Oracle aims to take the headache out of cloud migration

  Oracle aims to take the headache out of cloud migration The company is rolling out a new service that it says offers a single point of contact for technical delivery and removes barriers for adoption so customers can move away from data centers.Cloud Lift is designed to accelerate migrations and act as a "seamless path to the cloud," said Oracle Cloud Infrastructure SVP Vinay Kumar. In essence, it "provides a single point of contact for all technical delivery and removes critical expertise barriers for adoption of Oracle Cloud Infrastructure (OCI) services," Oracle said.

When Trifacta emerged roughly a decade ago, data preparation was targeted at data lakes, viewed as a rough-cut alternative to traditional ETL tools, typically using a spreadsheet-like interface where rudimentary machine learning capabilities would suggest columns names, spot specific types of data patterns such as street address, names, or personally-identifiable data such as account numbers, and then suggest which columns could be consolidated and modest corrections to make data more correct or uniform.

These capabilities eventually became commodity, and as such, ended up getting incorporated into ETL suites, data science tools, data catalogs, and so on. Unlike the old days of enterprise data warehousing, where IT or database developers handled data transformation, data preparation became a broad-based responsibility as end users, from business analysts to data scientists, clamored for self-service. Instead of forcing these folks into different tools, data prep grew ubiquitous in their existing workspaces and tools of choice.

Low-code workflow automation service now supports multicloud and hybrid cloud deployments

  Low-code workflow automation service now supports multicloud and hybrid cloud deployments Boomi aims to improve choice, flexibility and portability by increasing development and deployment options for citizen developers.Flow enables the creation of custom workflows and applications to take advantage of the benefits of multicloud deployment agility while keeping the Flow runtime and associated data within private environments.

Also: What is low-code and no-code? A guide to development platforms

Not surprisingly, most of Trifacta's pure play rivals have either disappeared or been acquired, among them, Paxata by Data Robot less than a year and a half ago. At this point, Alteryx, which also positions itself as an "analytics process automation" workbench for citizen data scientists, remains Trifacta's best-known rival.

Not surprisingly, with core data prep functions commoditized, the new Trifacta offering goes beyond that with predictive transformation that autodetects data formats and structures and infers transformation logic; "adaptive" data quality that statistically profiles data to identify complex patterns and suggest transformation rules; and "smart" data pipelines that model data flows. While data integration, data science, and analytic tools cover data prep, Trifacta is positioning its Data Engineering Cloud as a more deluxe service.

With the new cloud service, not surprisingly, Trifacta is rolling out consumption-based pricing, providing a contrast to the traditional licensing of its Wrangler on-premises suite. It's an expected route for SaaS providers, and for Trifacta, is intended to open up its addressable market beyond large enterprises that start with six-figure investments with tiers that start with free trials and starter subscriptions at $80/month.

How the quick shift to the cloud has led to more security risks

  How the quick shift to the cloud has led to more security risks Automating cloud security is a process still in its infancy for many organizations, says Unit 42.SEE: Managing the multicloud (ZDNet/TechRepublic special feature) | Download the free PDF version (TechRepublic)

The service, not surprisingly, is patterned off and expands on the OEM service that Trifacta has delivered with Google for the past three years. There will be feature parity across AWS and Azure, in addition to GCP. Nonetheless, GCP will remain first among equals as a jointly supported and sold OEM offering natively integrated to BigQuery.

Trifacta's challenge is akin to that of third party databases or analytic tools that are not the captive of a specific cloud provider, analytics tool, or data science workspace. It's the classic choice between umbrella platform vs. best of breed, and single cloud vs. multi-cloud. For Trifacta, it is enterprises whose data assets and analytic platforms are heterogenous and likely to remain so. With APIs, Trifacta aims to embed its data engineering services into the workflows of whatever runtimes that business analysts, data engineers, or data scientists are using. Thanks to its three years running an OEM service on Google Cloud, Trifacta is not entering the world of SaaS as a rookie.

Big Data

  • IOTA still wants to build a better blockchain, and get it right this time
  • Where is Snowflake going?
  • Oracle brings Autonomous Data Warehouse to the rest of us
  • Don’t delay, fix your data now for when quantum computing is fully ready (ZDNet YouTube)
  • Data literacy gap among young people could impact businesses (TechRepublic)

Pods and elastic engineering: Rackspace adds new services to support cloud operations .
The new capability can be accessed via a fractional subscription and supports all the major cloud providers.The company reports that customers will work with a dedicated pod of nine architects and engineers and use the services by subscribing to a fractional capacity from a pod.

usr: 0
This is interesting!