Academic
Publications
A distributed architecture for data mining and integration

A distributed architecture for data mining and integration,10.1145/1552280.1552282,Malcolm P. Atkinson,Jano I. van Hemert,Liangxiu Han,Ally Hume,Chee

A distributed architecture for data mining and integration   (Citations: 6)
BibTex | RIS | RefWorks Download
This paper presents the rationale for a new architecture to support a significant increase in the scale of data integration and data mining. It proposes the composition into one framework of (1) data mining and (2) data access and integration. We name the combined activity DMI. It supports enactment of DMI processes across heterogeneous and distributed data resources and data mining services. It posits that a useful division can be made between the facilities established to support the definition of DMI processes and the computational infrastructure provided to enact DMI processes. Communication between those two divisions is restricted to requests submitted to gateway services in a canonical DMI language. Larger-scale processes are enabled by incremental refinement of DMI-process definitions often by recomposition of lower-level definitions. Autonomous evolution of data resources and services is supported by types and descriptions which will support detection of inconsistencies and semi-automatic insertion of adaptations. These architectural ideas are being evaluated in a feasibility study that involves an application scenario and representatives of the community.
Published in 2009.
Cumulative Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
    • ...The execution of the workflow is fully controlled by the user and monitored by a specialized monitoring Web based application integrated to the ADMIRE Portal [4]...
    • ...The tool is implemented as a part of the ADMIRE Workbench [4], which represents the client side of the distributed environment...
    • ...Figures [3], [4] and [5] show screenshots of the...

    Ivan Janciaket al. Visualization of the mining models on a data mining and integration pl...

    • ...As described in our previous work [4], if a PE reads all of its input tuples, and requires multiple accesses to each tuple (referred as aggregative behaviour), it requires the entire data to be held in memory, or repeatedly re-read from disk...
    • ...A preliminary version of the ADMIRE architecture was reported in [4]...

    Chee Sun Liewet al. Towards optimising distributed data streaming graphs using parallel st...

    • ...Third, there are new computing patterns being introduced, including those for on-demand computing [2, 14], distributed computing [25], and integrating computing with data mining [5, 18]...

    Robert L. Grossmanet al. An overview of the Open Science Data Cloud

    • ...The design of the architecture is user-centric, and is driven by three types of users [4]: 1) domain experts who are users with an acute understanding of the concepts and logic underlying an application domain (for instance, financial analysts and investors); 2) algorithm experts who specialise in the algorithms and techniques for carrying out computational tasks identified by the domain experts (for instance, data mining and integration ...
    • ...This is done using DISPEL (a Data-Intensive Systems Process Engineering Language [4]), which abstracts workflow patterns as functions...

    Gagarine Yaikhomet al. Federated Enactment of Workflow Patterns

    • ...Research for Europe (ADMIRE) [3] is pioneering architectures and models that will deliver a coherent, extensible and flexible framework to accelerate access to high volumes of scientific data and business data and increase the benefits from data exploitation...
    • ...The proposed automatic optimizations are going to be integrated in the ADMIRE infrastructure gateway service [3]...
    • ...The necessary descriptions are stored in the ADMIRE registry service [3]...

    Alexander Wöhreret al. Logical Optimization of Dataflows for Data Mining and Integration Proc...

Sort by: