Phillip Kingston is a Visiting Professor at State University Kyiv Aviation Institute, Kyiv, Ukraine and a Member of Technical Staff at AppliedAI in Abu Dhabi, United Arab Emirates.
Phillip Kingston's recent research is focused on advancing AI-driven workflow automation in complex business environments. As evidenced by two key papers, his work addresses the end-to-end generation of complex workflows, from formalizing the intent behind a process to optimizing its execution, with an emphasis on incorporating domain knowledge and multimodal data. The following are the main technical areas of his contributions:Workflow Automation and Optimization
- AI-Driven Workflow Generation: Kingston develops frameworks for automatically generating workflows in complex Business Process Outsourcing (BPO) scenarios, aiming to reduce costs and improve quality. These workflows model business processes as sequences of tasks that transform a given input into a desired output while respecting operational constraints.
- Task Decomposition with DAGs: He represents workflows as Directed Acyclic Graphs (DAGs), where each node is a task and edges define execution flow. Each task consists of executable instructions (e.g. code, tool usage) and may include human expert review steps to ensure quality. This task-based decomposition provides a structured blueprint of complex processes.
- Two-Phase Generation & Optimization: Kingston's approach uses a two-phase methodology for workflow creation. In the generation phase, a Large Work Model (LWM) – a specialized AI model – produces candidate workflows from a high-level intention, leveraging domain knowledge. In the optimization phase, the candidate workflows are converted into a Workflow Graph, and path-finding algorithms then select an optimal workflow (from input to output) via path optimization techniques. This ensures the chosen workflow is both feasible and cost-effective.
Formalization of Workflow Intention and Business Artefacts
- Defining Workflow Signal and Intention: Kingston introduces formal concepts to capture the goal of a process. Workflow Intention is defined as the alignment of the Input, Process, and Output elements that describe a workflow's objective. This intention is interpreted from “Workflow Signals” embedded in business artefacts (documents, data) – essentially, clues in the artefacts that indicate what input-process-output triple is needed.
- Mathematical Representation: He provides a mathematical framework for these concepts, representing a Workflow Signal as a vector and a Workflow Intention as a tensor. By formalizing the intention as a tensor (a structured data object), the workflow's components and their relationships can be encoded in a machine-interpretable way. This rigorous representation helps ensure that AI models can accurately capture the structure and requirements of a workflow.
- Modular and Structured Workflow Modeling: Kingston's framework is implemented via a modular, trainable attention-based system that can ingest business artefacts and infer the workflow intention. The architecture is designed in separable modules (e.g. per data modality, see next section) for scalability. Furthermore, it leverages structured knowledge sources like a Work Knowledge Graph (WKG) to incorporate domain-specific procedural knowledge into the workflow model. By combining formal intention modeling with knowledge graphs, his approach grounds workflow generation in both data-driven signals and established business rules.
Multimodal AI and Attention-Based Processing
- Transformer Models for Diverse Data: Kingston's work leverages transformer architectures (with encoder-decoder designs) to extract information from both structured and unstructured data sources, such as text, images, and documents. For example, textual data is tokenized and processed with attention mechanisms to capture semantics, while image data is processed via OCR and convolutional encoders to extract visual features. These encoded representations are brought into a common high-dimensional space, enabling the system to handle multimodal inputs (e.g. mixing text and image content) in a unified manner.
- Attention-Based Multimodal Generation: He has developed an attention-driven multimodal generative system that reads business artefacts (e.g. forms, documents, images) and infers the underlying workflow intention. In this system, modality-specific encoders first handle each input type (text, image, etc.), and then intra-modality attention and fusion attention mechanisms combine information across modalities. An Intention Decoder network uses these attended features to produce the workflow intention (the triple of Input–Process–Output) that best explains the artefacts. This end-to-end design allows the AI to interpret complex business documents and derive the corresponding process they imply.
- Vision Transformers & Multimodal Integration: Kingston also incorporates state-of-the-art vision transformer (ViT) techniques and multimodal learning paradigms to integrate visual and textual information. Vision transformers treat image patches like tokens and apply self-attention, capturing both local and global image features without traditional convolution. His framework leverages such architectures to process image-based artefacts (like scanned documents) alongside text. Moreover, he draws on advanced multimodal transformers (e.g. models that fuse text and vision embeddings) to ensure the AI can reason across modalities. These approaches serve as the backbone for ingesting diverse Business Artefacts and constructing a contextual understanding from them, which is crucial for generating accurate workflows.
Graph-Based Knowledge Integration
- Work Knowledge Graph (WKG): A central element of Kingston's research is the use of knowledge graphs to inject domain and procedural knowledge into the workflow generation process. He helps construct a Work Knowledge Graph, which encodes known workflows, tasks, and their relationships gathered from various industries. This graph acts as a knowledge base of how different tasks connect and what processes are valid, capturing business rules and best practices in a structured form.
- Graph Neural Networks for Retrieval: To utilize the WKG during workflow generation, Kingston employs Graph Attention Networks (GATs) and similar graph neural network techniques. For instance, one approach involves training a GAT over the WKG so that it can suggest related workflow nodes (tasks) that should be included in a partially generated workflow. The GAT allows the system to retrieve relevant graph nodes by having tasks “attend” to neighbors' features, effectively finding procedural steps in the knowledge graph that align with the current workflow context. This integration ensures that the AI-generated workflow is informed by known valid processes.
- Retrieval-Augmented Workflow Generation: His methodology can be seen as a form of retrieval-augmented generation, where the AI first queries the knowledge graph for pertinent information and then uses that to guide workflow creation. Concretely, the system compares the encoded workflow intention to nodes in the WKG (using embedding similarity measures) and selects a subset of relevant nodes from the graph. Those retrieved graph elements (tasks or sub-workflows) are provided to the LWM model as additional context, so the model generates a workflow that is consistent with domain knowledge. By augmenting the generative model with knowledge graph retrieval, Kingston's approach increases the accuracy and compliance of the resulting workflows with real-world procedures.
Optimization Techniques for AI-Driven Workflows
- Cost Modeling for Workflows: To choose the best workflow among many possibilities, Kingston integrates a cost modeling approach into workflow optimization. Each candidate workflow (or path in the workflow graph) is evaluated with a cost function that accounts for multiple factors: computational expense, processing time, and any model/tool usage costs. He uses a weighted linear combination of these factors to assign a total cost to a given workflow path. By quantitatively modeling resource and time constraints, the system can objectively compare workflows not just on feasibility, but on efficiency.
- Graph Search and RL for Optimal Paths: For selecting the optimal execution path through the Workflow Graph, Kingston applies classic graph search algorithms and explores learning-based methods. In practice, his implementation uses a modified Dijkstra's algorithm to find the least-cost path from the input node to the output node, adapting the algorithm to consider node-associated costs (tasks costs) instead of just edge weights. Additionally, his research acknowledges and considers reinforcement learning (RL) strategies for workflow optimization, especially for complex or dynamic scenarios. While the current system iteration emphasizes graph algorithms due to the complexity of the action space in workflows, RL is examined as a potential approach for agents to learn optimal task sequencing in richly-defined environments.
- Evaluation and Benchmarking: Kingston places importance on evaluating AI-generated workflows against real-world standards. He has helped develop benchmarks in an industry context – for example, testing their Opus system on a medical coding workflow and demonstrating that the specialized models significantly outperform general Large Language Models (LLMs) (by 29–38% in workflow generation accuracy for that use case). Beyond accuracy, his work advocates for comprehensive metrics that include efficiency and compliance. In other words, a workflow is assessed not only on correctness, but also on execution time, resource consumption, and adherence to business rules. This holistic evaluation framework ensures that the AI-driven workflows are practical and reliable for deployment in enterprise settings, bridging the gap between AI-generated plans and real operational constraints.
Overall, Phillip Kingston's research contributions center on bridging AI with operational business processes – formalizing the concept of a workflow intention, harnessing multimodal data through advanced neural architectures, injecting knowledge via graphs, and optimizing the results for real-world efficiency. His work provides a pathway to transform high-level process requirements into executable, optimized workflows that maintain compliance with domain knowledge and performance criteria.
Papers:
- Opus – A Workflow Intention Framework for Complex Workflow Generation
- Opus – A Large Work Model for Complex Workflow Generation