- DataMigration.AI
- Posts
- Cirata Unveils DMaaS for Streamlined Hadoop to Cloud Migrations
Cirata Unveils DMaaS for Streamlined Hadoop to Cloud Migrations
With DMaaS, businesses can easily migrate Hadoop and other data environments to the cloud, significantly reducing migration timeframes and minimizing operational disruptions.
Welcome to the DataMigration.AI Newsletter: Transforming Data Migration with Generative AI Data migration can be complex, but at DataMigration.AI, a brand of Towards AGI, we’re making it smarter, faster, and more reliable than ever. Leveraging the power of Generative AI, our expert team and innovative technology work in tandem to tackle the intricacies of data transfer with precision and care.
Have you explored deploying AI Agents for your Data Migration initiatives?
Explore how our Managed Services for AI Agents can help you unlock the full potential of AI without the operational burden. As AI transforms industries by streamlining processes and enhancing decision-making, managing AI agents at scale demands continuous oversight, tuning, and robust security. With our end-to-end support, you can seamlessly deploy, monitor, and optimize AI agents aligned with your business goals, ensuring reliable, secure, and impactful performance. Let us handle the complexities, so you can focus on strategic growth and innovation. Dive in to discover how we can empower your AI initiatives!
Cirata Unveils DMaaS for Streamlined Hadoop to Cloud Migrations
Cirata, a company specializing in automating Hadoop data transfers to modern cloud analytics and AI platforms, has introduced its new Data Migration as a Service (DMaaS). This service aims to simplify large-scale cloud migrations, making data transfers more efficient for enterprises with complex data requirements. With DMaaS, businesses can easily migrate Hadoop and other data environments to the cloud, significantly reducing migration timeframes and minimizing operational disruptions.
"Cirata DMaaS empowers companies to leverage AI and analytics on cloud platforms without the technical burdens associated with extensive data migrations," said Danny Keating, Senior Director of Business Development at Cirata. "Our service eliminates the typical challenges of migration, allowing companies to focus on core objectives while we manage data transfers securely, smoothly, and at exceptional speed."
DMaaS is supported by Cirata’s Data Migrator tool, which automates the migration of data from the Hadoop Distributed File System (HDFS), Hive metadata, and other cloud data sources to leading cloud platforms, including Amazon S3, Azure Data Lake Storage Gen 2, Google Cloud Storage, and Oracle Object Store. This solution enables seamless data movement without modifications to existing applications or disruptions to production systems, ensuring data integrity and uninterrupted business operations.
Key features of Cirata DMaaS include:
- Automated, Scalable Migrations: Designed to handle migrations of any size, even in environments with active data changes, Cirata’s Data Migrator enables smooth transitions with no system downtime or business impact.
- Quick Deployment and Integration: With an average deployment time of only two weeks, Cirata’s DMaaS can expedite data migration, often completing in less than three months.
- Expert Guidance and Support: Cirata’s Data Integration Specialists offer personalized support from initial assessments through full implementation, providing documentation and expert advice to ensure a seamless migration.
- Tailored Onboarding Process: The Cirata team remotely evaluates each use case, coordinates deployment plans, and aligns migration objectives to meet each customer’s specific needs.
AWS Sunsets Snowcone and Legacy Snowball Devices in Data Migration Shift
Amazon Web Services (AWS) has announced it will discontinue its Snowcone data migration devices and phase out all but the latest versions of its Snowball Edge appliances. For those unfamiliar, the Snowcone was a compact, ruggedized network-attached storage (NAS) device, measuring 9x6x3 inches, designed to help customers who faced challenges in migrating large datasets to the cloud. Customers would load data onto the device and ship it back to Amazon, where it would then be transferred to a specified S3 bucket.
Officially discontinued on Tuesday, Snowcone devices and their associated documentation have been removed from the AWS website. Historical records show that Snowcone was available in versions with either an 8TB hard drive or a 14TB SSD.
Existing Snowcone users currently migrating data don’t need to worry about an abrupt stop; AWS has stated it will support existing customers until the same time next year. However, AWS urges users to complete their transfers and return the devices before they become unusable.
Additionally, AWS announced it is discontinuing three earlier models of the Snowball Edge appliance, which were versatile, suitcase-sized devices used for storage, migration, and edge computing. These models include the Snowball Edge Storage Optimized (80GB), Edge Compute Optimized with 52 vCPUs, and Compute Optimized with GPU.
AWS will provide a one-year grace period for customers using these older devices before requiring their return. Unlike Snowcone, though, the Snowball Edge series isn’t being entirely phased out. AWS will continue to offer the latest models, with the storage-optimized version now featuring up to 210TB of NVMe storage (with a 100TB option) and the compute-optimized version boasting 104 vCPUs. However, customers in need of a GPU-equipped model are out of options.
The announcement follows Amazon’s earlier decision, six months ago, to retire its Snowmobile fleet—massive trucks that transported petabytes of data on physical storage to and from the cloud.
Amazon’s rationale for discontinuing these physical data migration solutions is straightforward: most customers now prefer online migrations. AWS suggests using alternatives like DataSync or AWS Direct Connect for network-based migrations, and for edge computing needs, it recommends its standard 1U and 2U on-prem Outpost systems as more practical options compared to Snowball’s unique design.
Delhivery Shifts 500TB of Data to India in Compliance with DPDP Act
Delhivery, one of India’s leading third-party logistics providers, recently completed a significant data migration project, moving over 500 TB of essential data from AWS’s US East (Northern Virginia) Region to the Asia Pacific (Mumbai) Region, according to an AWS blog. The migration was reportedly carried out to align with India’s Digital Personal Data Protection (DPDP) Act, which stipulates that organizations store critical data within Indian borders.
However, it's worth noting that the current DPDP Act does not explicitly require data localization within India, though earlier versions of the legislation did emphasize the need for storing critical data domestically.
How Delhivery Executed the Data Migration
With a network covering over 18,000 pin codes across India, Delhivery has rapidly scaled its operations, relying on a robust data lake to support its business and analytics functions. The company successfully migrated more than 70 million data objects to AWS’s India-based regions.
Delhivery collaborated with Amazon Web Services (AWS) to facilitate this large-scale migration to the Asia Pacific (Mumbai) Region, utilizing Amazon S3 Cross-Region Replication (CRR) and S3 Batch Operations. These tools enabled Delhivery to replicate data in near real-time and transfer historical records.
The migration was completed in two phases:
S3 Cross-Region Replication: Delhivery used real-time replication for newly created data, ensuring an instant backup in India and eliminating potential data loss.
S3 Batch Operations: For historical data stored in the U.S., Delhivery transferred it in batches over several weeks to the Indian repository.
Understanding AWS Regions and Their Functionality
AWS Regions refer to specific geographic areas where AWS data centers operate globally. Each region includes multiple isolated Availability Zones designed for redundancy and reliability. By deploying across multiple zones, users can improve application resilience and fault tolerance.
To handle this extensive migration, Delhivery leveraged AWS’s tools, including Amazon S3 CRR and S3 Batch Operations, ensuring minimal operational disruption. Delhivery’s infrastructure, comprising over 800 data pipelines, processes 60,000 messages per second and ingests 350 GB of data daily.
Concerns Around the DPDP Act, 2023
The DPDP Act has raised several issues, particularly regarding the government’s broad powers to exempt itself from the law. The Act allows the collection of publicly available personal data and grants the government authority to block content. It also imposes additional compliance requirements on data-processing companies while leaving gaps in data breach protocols. Additionally, the Act could potentially weaken the RTI Act’s effectiveness.
Thank you for joining us in this edition of the DataMigration.AI Newsletter! We’re excited to continue sharing how Generative AI is transforming data migration, making it smarter, faster, and more resilient. Our commitment to combining expert support, innovative technology, and customized solutions ensures that your data migration journey is smooth and aligned with your business goals.
As we move forward, expect more insights into our evolving methodologies, success stories, and best practices. At DataMigration.AI, we’re here to empower your organization to unlock the true value of your data with confidence and precision. Stay tuned for our next issue as we continue redefining what’s possible in data migration, ensuring your projects are not only successful today but also future-ready.
Contact us for any collaborations and sponsorships.