Figure 1 shows a high-level component overviewand architecture of the machine learning platform and highlights the componentsdiscussed in the following sections: Architecture andOverviewBeta – Action EngineAlpha – Training pipeline , Admin panelMVP – Prediction Serving System and API gateway The Praedictio platforms road map has been envisioned todeliver the core components in an iterative manner. Product Road MapOnly a small fraction of amachine learning platform is the actual code implementing the trainingalgorithm. If the platform handles and encapsulates the complexity of machinelearning deployment, engineers and scientists have more time to focus on the modelingtasks. Production-level reliability and scalability. Providing an admin andconfiguration framework is only possible if components also share utilitiesthat allow them to communicate and share assets. A Praedictio user is onlyexposed to one admin panel to manage all components.
Easy-to-use configuration and tools. Most machine learning pipelinesexecute the components in a sequential manner leading to all the components tobe re-executed with the growth of datato be fed. This becomes a bottlenecksince most of the real world use cases require continuous training. Preadictiosupports several continuation strategies that result from the interactionbetween data visitation and warm-starting options.
Continuous training. We chose to use TensorFlow andScikit Learn as the trainer but the platform design is not limited to thesespecific librarariesy.One factor in choosing (or dismissing) a machine learning platform is itscoverage of existing algorithms.
Scikit holds a wide variety of pre implementedML algorithms and TensorFlow provides full flexibility for implementing anytype of model architecture. There is a large and growingnumber of machine learning frameworks. Each framework has strengths andweaknesses and many are optimized for specific models or application domains(e.g., computer vision).
Thus, there is no dominant framework and oftenmultiple frameworks may be used for a single application. In a situation where training data growsrequirement arises for a framework with distributed training leading to changeof frameworks once selected as the best available in Machine Learning. Even though common model exchange formats hadbeen introduced in the past due to the rapid technological advancements andfact that additional errors arising from parallel implementations for trainingand serving these common message formats didn’t gain popularity. One machine learning platform for many learning tasks. The Praedictio design adopts thefollowing principles: Platform Design andAnatomyPraedictio introduces a modulararchitecture to simplify model development and deployment across frameworks andapplications. Furthermore, by introducing caching, batching, and adaptive modelselection techniques, Praedictio reduces prediction latency and improvesprediction throughput, accuracy, and robustness without modifying theunderlying machine learning frameworks. The platform also can be Integratedwith enterprise systems, while satisfying stringent data security, privacy, orregulatory requirements.
Machine Learning is growing it’spopularity in a wide spectrum of business domains to cater the need ofproviding customer focused, accurate and robust business insights. Creating andmaintaining a platform for reliably producing and deploying machine learningmodels requires careful orchestration of many components—-a learner for generatingmodels based on training data, modules for analyzing and validating both dataas well as models, and finally infrastructure for serving models in production.One of the biggest challenges in creating and maintaining a Machine Learningbased prediction system is orchestrating the Model Creation, Learning, ModelValidation and Deployment and Infrastructure Maintenance in Productionenvironment. With the high volatility of data and improvedlearning models deploying fresh models become trickier. Most machine learning frameworks and systemsonly address model training or deployment and connectivity between different components is done ad hocvia glue code or custom scripts. Praedictio integrates the aforementioned compoentsinto one platform simplifying the platform configuration and reducing time toproduction while increasing scalability. IntroductionPraedictio can run on-premise oron any cloud platform and serve highly accurate business predictions that willenable the business owners and decision makers to make timely decisions ontheir business.
Praedictio is a businesspredictions framework that provides powerful predictive analytics by analysingthe business data scattered across different repositories in an organization.Praedictio framework will enable developers and data scientists to integratedata driven ML models to business applications quickly and easily with powerfultools to aggregate data, do data modeling, training and deploying. Abstract