Themes

Philosophies and Methodologies for Knowledge Discovery

Copenhagen, Denmark:  24th August 2005

Rationale
Themes
Prog. Comm.
CALL
PROGRAMME
Dates & Fees
Publication

 

Keywords: metadata, meta-models, semantics, ontology, measurement theory, models, knowledge, meaning, epistemology, knowledge-discovery, creativity, perception, consciousness.

Many of the issues considered in the main DEXA and DEWAK conferences relate to sophisticated technical developments relating to particular problems, algorithms and applications of data mining and knowledge discovery techniques.  In contrast, this Workshop focuses on the general philosophical and methodological issues underlying the processes of knowledge discovery. 

All data mining algorithms are based on families of descriptive or predictive models which implicitly contain within them a view of the “real-world” (i.e. an Ontology).  Such views involve concepts of entities in that real world, their attribute sets, the relationships between them, and a measurement theory for how data is obtained.  The relationships between attributes, variables, and measurements stored in databases needs to be clearly and meaningfully described and captured in knowledge discovery. 

The necessary and logical relationships that exist between attributes of entities in the real world, and the measured attribute names, and relational structures embedded into a database of data from the real world embody the Semantics relating to metadata, another theme of the Workshop.  Explication of such world views of existence and frameworks for conversing about them, for data mining and knowledge discovery techniques, in relation to metadata and its semantics, is clearly important and is also one of the primary themes of the workshop. 

However, the real-world is extremely complex, as are the large and complex databases of data that are collected from them.  Models of the real-world calibrated/trained/fitted to empirical data are encapsulations of knowledge about the real world. However, such models will necessarily be simplifications or approximations to relationships and processes of reality, with the consequence that all data-derived world-view knowledge-model representations are only ever partial, and hence essentially erroneous to some extent.  Reality also seems to contain such heterogeneity that we ascribe “randomness/stochastisty” to many events and processes, and it is important to distinguish the differences between the deviations of data from the fitted/trained/calibrated model which may be ascribed to the model lack of fit to reality, and those deviations which arise from the essential stochasticity of the real world.  This stochasticity itself has a structure in the real world and obtaining knowledge models of this stochasticity structure is also an important feature of knowledge discovery. All of these considerations may be put under the general heading of ‘Theories of Knowledge and Meaning’, or Epistemology, another main theme of this Workshop. 

Notation and symbolism are powerful intellectual aspects of model formulation and representation which can have an influence on the interpretation of the meaning of discovered knowledge. This area, Semiotics, is therefore another topic for this workshop. 

Finally, knowledge discovery in large and complex databases cannot be limited to the training of a predetermined and static family of models/algorithms.  Machine knowledge search and discovery strategies need also to be modelled on the way in which humans search for and discover knowledge, but also within new frameworks and paradigms.  Hence Artificial Neural Nets, (ANNs), Genetic Algorithms, (Gas), and Intelligent Agents all may be regarded as dynamic adaptive heuristics for knowledge discovery.  It is important to consider questions of knowledge and meaning, discussed above, in relation to such machine-learning algorithms. 

All the philosophical issues mentioned above relate directly to issues of data mining and knowledge discovery.  However, it is clear that they are also the issues which underpin the major topics of human and machine perception, consciousness, self-consciousness, and even free-will  and creativity.  It might be the case that Workshop will find it necessary to consider these issues if it is to look at the future of knowledge discovery methodology. 

Preliminary Themes/Topics:

1.  Methodologies for Data-mining and Knowledge Discovery.
2.  Ontology (Reality)
  :  the “real” world, entities, relationships, metadata, XML, metamodels, semantics, measurement theory.
3.  Epistemology (Knowledge and Meaning) :
Models of reality and knowledge, Semiotics, Determinism, Stochasticity, Machine-learning, Interpretation.
4.  Machine-Learning and Artificial Intelligence : Adaptive search and discovery algorithms, perception, consciousness.