Capability matrix
The following tables summarize the key differences in capabilities.
General capabilities
| Capability | Azure Data Factory | SQL Server Integration Services (SSIS) | Oozie on HDInsight |
|---|---|---|---|
| Managed | Yes | No | Yes |
| Cloud-based | Yes | No (local) | Yes |
| Prerequisite | Azure Subscription | SQL Server | Azure Subscription, HDInsight cluster |
| Management tools | Azure Portal, PowerShell, CLI, .NET SDK | SSMS, PowerShell | Bash shell, Oozie REST API, Oozie web UI |
| Pricing | Pay per usage | Licensing / pay for features | No additional charge on top of running the HDInsight cluster |
Pipeline capabilities
| Capability | Azure Data Factory | SQL Server Integration Services (SSIS) | Oozie on HDInsight |
|---|---|---|---|
| Copy data | Yes | Yes | Yes |
| Custom transformations | Yes | Yes | Yes (MapReduce, Pig, and Hive jobs) |
| Azure Machine Learning scoring | Yes | Yes (with scripting) | No |
| HDInsight On-Demand | Yes | No | No |
| Azure Batch | Yes | No | No |
| Pig, Hive, MapReduce | Yes | No | Yes |
| Spark | Yes | No | No |
| Execute SSIS Package | Yes | Yes | No |
| Control flow | Yes | Yes | Yes |
| Access on-premises data | Yes | Yes | No |
Scalability capabilities
| Capability | Azure Data Factory | SQL Server Integration Services (SSIS) | Oozie on HDInsight |
|---|---|---|---|
| Scale up | Yes | No | No |
| Scale out | Yes | No | Yes (by adding worker nodes to cluster) |
| Optimized for big data | Yes | No | Yes |
No comments:
Post a Comment