Capability matrix
The following tables summarize the key differences in capabilities.
General capabilities
Capability | Azure Data Factory | SQL Server Integration Services (SSIS) | Oozie on HDInsight |
---|---|---|---|
Managed | Yes | No | Yes |
Cloud-based | Yes | No (local) | Yes |
Prerequisite | Azure Subscription | SQL Server | Azure Subscription, HDInsight cluster |
Management tools | Azure Portal, PowerShell, CLI, .NET SDK | SSMS, PowerShell | Bash shell, Oozie REST API, Oozie web UI |
Pricing | Pay per usage | Licensing / pay for features | No additional charge on top of running the HDInsight cluster |
Pipeline capabilities
Capability | Azure Data Factory | SQL Server Integration Services (SSIS) | Oozie on HDInsight |
---|---|---|---|
Copy data | Yes | Yes | Yes |
Custom transformations | Yes | Yes | Yes (MapReduce, Pig, and Hive jobs) |
Azure Machine Learning scoring | Yes | Yes (with scripting) | No |
HDInsight On-Demand | Yes | No | No |
Azure Batch | Yes | No | No |
Pig, Hive, MapReduce | Yes | No | Yes |
Spark | Yes | No | No |
Execute SSIS Package | Yes | Yes | No |
Control flow | Yes | Yes | Yes |
Access on-premises data | Yes | Yes | No |
Scalability capabilities
Capability | Azure Data Factory | SQL Server Integration Services (SSIS) | Oozie on HDInsight |
---|---|---|---|
Scale up | Yes | No | No |
Scale out | Yes | No | Yes (by adding worker nodes to cluster) |
Optimized for big data | Yes | No | Yes |
No comments:
Post a Comment