Leveraging Databricks Pipelines for Machine Learning Model Deployment and Monitoring
DOI:
https://doi.org/10.32628/CSEIT241061256Keywords:
Databricks, Machine Learning, Cloud Computing, Model DeploymentAbstract
This article explores the use of Databricks pipelines for deploying and monitoring machine learning models efficiently. Databricks provides a scalable and collaborative environment that integrates seamlessly with MLflow for tracking and managing the machine learning lifecycle. The study examines the various stages of machine learning pipeline development, including automated data preprocessing, model training, hyperparameter optimization, and deployment. Additionally, it highlights the significance of continuous monitoring through Databricks’ built-in metrics and logging features, ensuring real-time performance tracking and model drift detection. By leveraging cloud-based platforms such as Azure and AWS, along with powerful tools like Delta Lake and Databricks MLflow, organizations can enhance the reliability and scalability of their ML workflows.
Downloads
References
MLOps Definition and Benefits https://www.databricks.com/glossary/mlops
AI and machine learning on Databricks https://docs.databricks.com/aws/en/machine-learning/
Delta Live Tables Databricks Documentation https://docs.databricks.com/aws/en/delta-live-tables/
Best Practices for Data Engineering with Databricks Part 1, accessed April 1, 2025, https://www.persistent.com/blogs/best-practices-for-data-engineering-with-databricks-part-one/
MLOps workflows https://docs.databricks.com/aws/en/machine-learning/mlops/mlops-workflow
Aritra Ghosh, How to orchestrate MLOps by using Azure Databricks? https://www.linkedin.com/pulse/how-orchestrate-mlops-using-azure-databricks-aritra-ghosh/
Sachin Dixit, & Jagdish Jangid. (2024). Asynchronous SCIM Profile for Security Event Tokens. Journal of Computational Analysis and Applications (JoCAAA), 33(06), 1357–1371. Retrieved from https://eudoxuspress.com/index.php/pub/article/view/1935
Satyanarayana Murthy Polisetty, Santhosh Kumar Pendyala, Sushil Prabhu Prabhakaran. (2023). Enhancing Data Quality in Streaming Data Pipelines Using Delta Live Tables. International Journalof Research in Computer Applications and Information Technology (IJRCAIT), 6(1), 87-96.
Pendyala,SK, “Cloud-Driven Data Engineering: Multi-Layered Architecture for Semantic Interoperability in Healthcare” Journal of Business Intelligence and Data Analytics., 2023, vol. 1, no. 1, pp. 1–14. doi: https://10.55124/jbid.v1i1.244
Sushil Prabhu Prabhakaran, Satyanarayana Murthy Polisetty, Santhosh Kumar Pendyala. Building a Unified and Scalable Data Ecosystem: AI-Driven Solution Architecture for Cloud Data Analytics. International Journal of Computer Engineering and Technology (IJCET), 13(3), 2022, pp. 137-153. https://iaeme.com/Home/issue/IJCET?Volume=13&Issue=3
Satyanarayana Murthy Polisetty, Santhosh Kumar Pendyala, Sushil Prabhu Prabhakaran. Strengthening Data Integrity and Security via Cloud Administration and Access Control Strategies. International Journal of Computer Engineering and Technology (IJCET), 14(3), 2023, 283-297. https://iaeme.com/MasterAdmin/Journal_uploads/IJCET/VOLUME_14_ISSUE_3/IJCET_14_03_027.pdf
Santhosh Kumar Pendyala, Satyanarayana Murthy Polisetty, Sushil Prabhu Prabhakaran. Advancing Healthcare Interoperability Through Cloud-Based Data Analytics: Implementing FHIR Solutions on AWS. International Journal of Research in Computer Applications and Information Technology (IJRCAIT), 5(1), 2022, pp. 13-20. https://iaeme.com/Home/issue/IJRCAIT?Volume=5&Issue=1
Sushil Prabhu Prabhakaran, Satyanarayana Murthy Polisetty,Santhosh Kumar Pendyala. Building a Unified and Scalable Data Ecosystem: AI-DrivenSolution Architecture for Cloud Data Analytics. International Journal of Computer Engineering and Technology (IJCET), 13(3), 2022, pp. 137-153. https://iaeme.com/Home/issue/IJCET?Volume=13&Issue=3
Malhotra, S., Yashu, F., & Malviya, A. (2024). Serverless mesh architectures for multi-cloud and edge. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 10(1), 326–329. https://doi.org/10.32628/CSEIT2425446
(PDF) Building a Unified and Scalable Data Ecosystem: AI-Driven Solution Architecture for Cloud- DataAnalytics.https://www.researchgate.net/publication/389906454_Building_a_Unified_and_Scalable_Data_Ecosystem_AI-Driven_Solution_Architecture_for_Cloud_Data_Analytics
Satyanarayana Murthy Polisetty, Santhosh Kumar Pendyala, Sushil Prabhu Prabhakaran. (2024). Cloud-Native Lakehouses: Multi-Cloud Strategies for Business Intelligence and Data Analytics. International Journal of Research in Computer Applications and Information Technology (IJRCAIT), 7(1), 74-93. https://iaeme.com/MasterAdmin/Journal_uploads/IJRCAIT/VOLUME_7_ISSUE_1/IJRCAIT_07_01_009.pdf
Downloads
Published
Issue
Section
License
Copyright (c) 2024 International Journal of Scientific Research in Computer Science, Engineering and Information Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.