<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Case Based Reasoning for Knowledge Management in KDD-Projects</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Concepts</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Organizational Setting</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Categorization into KM</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Kai Bartlmae DaimlerChrysler AG, Research and Technology 3</institution>
          <addr-line>D-89013 Ulm</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Michael Riemenschneider DaimlerChrysler AG, Research and Technology 3</institution>
          <addr-line>D-89013 Ulm</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we introduce our departments organizational and technical infrastructure for knowledge-intensive and weak-structured processes: A framework for Knowledge Management in the case of projects in Knowledge Discovery in Databases (KDD). It is based on the experience factory approach and the method of case based reasoning. We introduce both approaches in the context of knowledge management, derive application-areas and introduce our realization for projects in knowledge discovery in databases.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>Many knowledge intensive activities take place in project
organizations, where project teams form a temporal
organization, which are disbanded after the projects are
completed. This shows especially true for the work, our
department FT3/AD is involved in, Knowledge Discovery
in Databases. Here we analyze customer databases of
different DaimlerChrysler branches i.e. for marketing
reasons or for assessing credit risk. Because we work in
these temporal teams, it is our interest that the experience
gained in these projects should not only be kept as the
team members personal knowledge, but be kept within our
business organization in order to be reused.</p>
      <p>The copyright of this paper belongs to the paper’s authors. Permission to copy
without fee all or part of this material is granted provided that the copies are not
made or distributed for direct commercial advantage.</p>
      <sec id="sec-1-1">
        <title>Proc. of the Third Int. Conf. on Practical Aspects of</title>
      </sec>
      <sec id="sec-1-2">
        <title>Knowledge Management (PAKM2000)</title>
      </sec>
      <sec id="sec-1-3">
        <title>Basel, Switzerland, 30-31 Oct. 2000, (U. Reimer, ed.)</title>
        <p>http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-34/
This is not a new problem, so i.e. Heisig [HEI98b]
proposes that before a new project is being started, a plan
for collecting know-how and experiences should be
prepared, considering the topics of:
•
•
•
•
•
•</p>
        <p>Who is responsible for experience collection?
Where can know-how be gained?
Who gained certain experiences?
In what form should the experience be documented?
How are the experiences collected and saved?</p>
        <p>How are the experiences be disseminated?
But experience documentation has many barriers, so it is a
time intensive task and the person documenting it will in
many cases not be the user of it and therefore reluctant to
share [KPMG98]. Further, the project teams are under
time pressure and therefore the motivation for
documenting experiences is initially low. A goal of an
approach must be to give the team members help and time
when documenting their own project experiences as well
as giving them the information they need as easy and
quick as possible, releasing them from administrative
work. Further, project-management has to make the team
aware of the need for knowledge management, to define
processes for it, to train the teams, and last but not least to
introduce a technical infrastructure to collect, disseminate
and reuse them.</p>
        <p>Here we try to approach these problems with the concept
of case based reasoning together with the experience
factory concept by Basili et al. [BCR94], building the
base of the Experience Factory in Knowledge Discovery
in Databases at FT3/AD (see also [BAR99]). It covers
necessary aspects mentioned above for knowledge
management in project work. The approach has its basis in
the domain of software engineering and successfully be
applied by Althoff et al. [ABT97].
In this paper we introduce both concepts, show where they
can complete each other and how they cover the different
building blocks of knowledge management. The paper
concludes with the description of the KDD-experience
factory, describing selected experience package types
used.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2 Organizational</title>
    </sec>
    <sec id="sec-3">
      <title>Factory approach view: The</title>
    </sec>
    <sec id="sec-4">
      <title>Experience</title>
      <p>The approach of the experience factory has been
introduced by Basili et al. as an evolutionary, experience
based approach for the improvement of software-products
and software-development-processes. They were
motivated by the realization, that collected experiences
can improve development processes[HOU99]. Based on
the Quality Improvement Paradigm (QIP), the experience
factory has been introduced as an organization that
supports the projects teams in the different steps of QIP.
One basic determinant of it is its organizational separation
from the project teams in order to compensate the
different goals project-teams and experience-management
have[ABT97]:
While project teams try to reach their project goals fast
and within a cost-frame, experience management wants
the avoidance of mistakes or the installation of good
practices using collected experiences. But this process of
experience collection is time consuming and costly,
meaning additionally effort for the project members. This
is why an organizational separation of collection and the
creation of experiences might prove useful. The
organization for collecting, structuring, saving and
disseminating of experiences is called an Experience
Factory by Basili. Experience packages (EP) are its form
for representing experiences of different structure and
types, from data to process definitions. These are saved in
an experience base, which can be compared to a
safeguarded organizational memory.</p>
      <p>The experience factory approach is in its basic form very
abstract and conceptional [HOU99]. In order to apply it, it
is necessary to define its specific goals, the tasks and
processes of the involved agents and to install a
(technological) platform.</p>
      <p>The experience factory approach has been applied in
different applications, here we modified the model in
order to apply it in the domain of Knowledge Discovery in
Databases (Figure 1).</p>
      <p>The approach proposed by Basili has been tailored
according to [ANT97]. We also distinguish the project
teams, conducting different KDD-projects, and the
experience factory organization, according to [HOU99],
with the roles of the Experience Engineer, the Experience
Factory Manager and EF-supporting-agents. See also
[BT98] for a similar differentiation.</p>
      <p>Experience
existing in
the world at
large</p>
      <p>Company/
Department</p>
      <p>Projectorganisation</p>
      <p>DM-Project n</p>
      <p>DM-Project 2
DM-Project 1
The building blocks according to Probst [PRR99] build a
general framework for Knowledge Management and is
based on a 2-layered learning cycle. The outside cycle
consists of the elements goals, realization and valuation
and describes a traditional management control cycle. The
inner cycle is represented by the blocks of knowledge
identification, knowledge acquisition, knowledge
development, knowledge distribution, knowledge use and
knowledge preservation. The blocks represent an overall
approach for management of knowledge in a business
organization. Since the conceptual approach by Probst et
al. incorporates the whole organization, the different
concrete knowledge managing initiatives have to be fitted
in this general approach. Here the Experience Factory
should represent a concretization of these blocks. We
therefore mapped the approach in order to investigate,
how the EF fulfilled these. Further, an approach should be
integratable, problem-oriented, understandable,
actionoriented and give instruments. Here we want to introduce,
how the experience factory and the used CBR-approach
realize these requirements.</p>
      <p>In the inner cycle, the blocks of knowledge use and
knowledge development is in the scope of the project
organizations and teams. On the other hand, knowledge
identification is one major task of the experience
engineers, but depends on the help of the project teams.
The EF-supporting agents are responsible for assisting the
project-teams and the knowledge distribution. This can
happen through joining the project teams, helping through
seminars in our so called KDD-Shop or last but not least
through our experience base called Core-DM (Case
Oriented Reuse of Experiences in Data Mining). Here our
department FT3/AD plays an experience factory-like role
for the different departments conducting knowledge
discovery in databases in corporation with us. We
represent a competence center in KDD, helping project
partners to conduct KDD. As a research department
within DaimlerChrysler, we are further interested in the
development and application of new KDD technologies.
We participate in KDD research and present the results to
leading academic conferences. But on the other hand, we
conduct knowledge acquisition through the buy in and
Outer Cycle
Inner Cycle
Knowledge Knowledge
Identification Aquisition
evaluation of products, through the cooperation with
universities and hiring of new personnel. At FT3/AD we
installed the KDD-Shop, where we evaluate new tools and
train our own teams and that of the project members in
order keep track with the state-of-the-art in our domain.
Our experience engineers are responsible for the
documentation of experience packages and artifacts of
projects. This is done in cooperation with the project team
and according to the EF-management's formulated
operative knowledge goals of what types should be
collected and how the infrastructure and processes should
look. They are therefore all the persons responsible for
knowledge preservation. Further, the EF-supportive
agents take part in the collection of information and
experiences within the project teams.</p>
      <p>In the outer cycle, the responsibility for setting knowledge
goals can be found on different levels. Probst et al.
differentiates between levels for formulating knowledge
goals, of interest are more the operative ones. Here
realistic goals have to be formulated and further, measures
to value these have to be defined and evaluated, closing
the loop with the formulation of optimized knowledge
goals.</p>
      <p>It can be seen, that the basic roles and responsibilities of
the Experience Factory can be assigned to the KM
building block approach in the context of our department
FT3/AD and Knowledge Discovery in Databases (see also
figure 2). Although the experience factory approach has
its focus on collection and reusing experiences in project
work, it also covers with its roles the basic blocks of an
general knowledge management approach. While the
Experience Factory is more of an organizational approach,
giving roles to the different persons, we now want to
introduce a more technical approach for completion.</p>
    </sec>
    <sec id="sec-5">
      <title>4 Cognitive</title>
    </sec>
    <sec id="sec-6">
      <title>Reasoning</title>
    </sec>
    <sec id="sec-7">
      <title>Sciences</title>
    </sec>
    <sec id="sec-8">
      <title>View:</title>
    </sec>
    <sec id="sec-9">
      <title>Case</title>
    </sec>
    <sec id="sec-10">
      <title>Based</title>
      <p>The approach of case based reasoning (CBR) and
knowledge management share the same goal: the use and
development of knowledge.</p>
      <p>While one can understand under knowledge management
a general and large area, incorporating different methods
Knowledge Knowledge
Development Distribution</p>
      <p>Knowledge Knowledge Knowledge Knowledge
Use Preservation Goals Valuation
Management
Project Organization
Figure 2: Assigning the responsibilities of the EF organization to the building blocks of Knowledge Management
of Probst et al.. (grey = important role, black = less important role).
2-3
and techniques, i.e. from organizational and technical,
case based reasoning represents a very concrete approach
for these mentioned goals.</p>
      <p>As we did this in the last section with the experience
factory, we will now introduce the case based reasoning
approach and show, how it can be used in the general
Probst framework and how the building blocks are
covered by CBR.</p>
      <p>The basic idea of case based reasoning is, that for solving
a new problem, a concrete similar but solved solution is
tailored to the new context and reused [WES96]. It is</p>
      <sec id="sec-10-1">
        <title>Experience Factory Organization</title>
      </sec>
      <sec id="sec-10-2">
        <title>Experience Factory Organization</title>
      </sec>
      <sec id="sec-10-3">
        <title>EF Support Experience EF</title>
      </sec>
      <sec id="sec-10-4">
        <title>Agent Engineer Manager</title>
        <p>Alow for
changes of
system</p>
        <p>CBR
SysCteBmR</p>
      </sec>
      <sec id="sec-10-5">
        <title>System</title>
        <p>Presentation
of cases,
experience
Save changed
domain model
Save changed
similarity
measures
Save case in
system</p>
      </sec>
      <sec id="sec-10-6">
        <title>Project</title>
      </sec>
      <sec id="sec-10-7">
        <title>Project</title>
      </sec>
      <sec id="sec-10-8">
        <title>Organization</title>
      </sec>
      <sec id="sec-10-9">
        <title>Organization</title>
        <p>Problem
description,
query to the
system
Adaptation
and reuse of
cases
New solution
Verify solution
and further
adaptation
Feedback to
the EF about
used cases</p>
      </sec>
      <sec id="sec-10-10">
        <title>Retrieve</title>
        <p>Support during
documentation
and selection</p>
      </sec>
      <sec id="sec-10-11">
        <title>Reuse</title>
      </sec>
      <sec id="sec-10-12">
        <title>Revise</title>
        <p>Confirmed
solution</p>
      </sec>
      <sec id="sec-10-13">
        <title>Retain</title>
        <p>New case</p>
        <p>Evaluation and
adaptation of</p>
        <p>cases,
experiences
Evaluation and
adaptation of
the system</p>
        <p>Final
evaluation of</p>
        <p>cases
Delete case</p>
        <p>Save case
Outer Cycle
Inner Cycle
2-4
based on a learning-cycle, including its phases retrieval,
reuse, revise and retain of cases and experiences [AP94].
It is based on cognition-psychology, stating that experts
tend to reuse concrete experiences rather than to solve
new problems from the ground up. Case based reasoning
tries to realize this idea by describing a problem and its
solution by a set of attributes and saves them as a case into
a case- or experience base. Besides this knowledge in the
experience-base in the form of cases, it is necessary to
formulate general knowledge on how to select, interpret
and transform cases, i.e. to formulate similarity measures
or how to transform the old solution into the new one.</p>
      </sec>
    </sec>
    <sec id="sec-11">
      <title>5 The Experience Factory as organizational framework for realizing a case based reasoning system</title>
      <p>Through the Experience Factory, a case based reasoning
system can be given a organizational framework
[ABT97]. With this framework, it is possible to
compensate the organizational deficits of the CBR
approach and assign responsibilities within the
CBRlearning cycle:
First of all, the EF-supporting agents, together with the
project-teams, are responsible for collecting cases that are
candidates for being saved into the experience-base. They
give these to the experience engineer for further
documentation. On the other side, they are responsible for
supporting the project teams by formulating queries to the
experience base and for retrieving old cases. They build
the interface between the project teams and the EF
organization.
the necessary knowledge goals of what is to be reached
with this approach and how its success can be measured.
The most important part of a CBR-system is the
experience base, where the cases are saved in the form of
experience packages. The experience packages are
accessed during the retrieve phase using a similarity based
measure. In most cases a technological platform exists, in
order to do this in an easy and fast way. In figure 3, the
lifecycle of a case can be seen along the CBR-phases and
the responsibilities according to the experience factory
organization.</p>
      <p>In figure 4 we mapped the CBR cycle onto the KM
building blocks. It can be seen that the CBR cycle by
[AP94] corresponds to the realization of the inner
knowledge management cycle according to Probst et al.
But also the design, evaluation and maintenance of a
CBR-system are important topics that need to be covered
by an overall approach. Here, we see the EF-manager and
the experience engineer responsible for the development
of the system, i.e. of the domain model, the structure, the
similarity measures and its technical implementation.
In figure 5 we assigned for each of the EF roles the
different CBR-phases and added the missing building
blocks. This combined framework of experience factory
and case based reasoning now covers all necessary steps
of a major knowledge management framework, making it
to a concretization of the introduced KM-building blocks.
Outer KM Cycle
Inner KM Cycle
CBR-System
Run-time
Retrieve</p>
      <p>Reuse</p>
      <p>Revise</p>
      <p>Retain</p>
      <p>CBR System Knowledge Knowledge
Desgin Goals Valuation
The experience engineer is responsible for the final
structure and form of the cases. He is a safeguard that the
quality of the cases are adequate. Further he has to
evaluate and perform maintenance operations on the
experience base and its cases. If necessary, he alters the
similarity measures for improved retrieval performance or
changes the case-structure. On the other hand, the
EFmanagement, together with the rest of the EF-team, sets</p>
    </sec>
    <sec id="sec-12">
      <title>6 Representation of KDD experience in a case based reasoning system</title>
      <p>In a case based reasoning system, knowledge is saved in
so called Knowledge Containers, which are case-base,
structure/vocabulary, similarity measures and transaction
knowledge [RIC98].
2-5
The development of a CBR system starts with the
structural description of the application domain. This
includes the kind of cases one wants to describe, their
structure and the definition of their attributes. Further, a
similarity measure has to be defined for retrieval from the
experience base. As a last step, knowledge on how to
transform an old solution to the current situation can be
included through transaction rules, but in our case the
transaction has to be performed by the user of the system
without technological help. The whole structural
description of a domain is called domain model and is
based on the following primitives [WES96]:</p>
      <p>Attribute and types, describe features of a
domain (i.e. Text, Reals, Integer)
Concepts, objects, describe concrete entities of
the domain
Relations describe the relationship between
objects
Rules, describe rule-based relations between
objects
Based on the structural description of the domain, a
similarity measure is defined in order to retrieve similar
cases from the case base. For each attribute of a given
case and a given query, a similarity can then be calculated,
which are aggregated to an overall similarity score
between a case and a query. The most similar cases can
then be presented to the user of the system. Using as
similarity measure makes it possible to find not only
completely fitting packages, but also "near-matches",
which is in the sense of CBR.</p>
      <p>We further used keywords to describe the packages
textual components. The keyword concept allows the
introduction of additional context description and assists
the user to identify useful packages.
Rather than relying on the experience engineer to find
good keywords, we combine our structural CBR approach
with a textual CBR technique (tCBR) for the
representation of the knowledge of the textual parts (See
also [BL00]). Here we rely on the structured form of the
cases and use the textual components to extract
Information Entities (IEs) about the packages. The
knowledge for identifying the IEs of the packages is given
by a set of term indices, thesauri, a product/name-index
and a term-generalization-index. The content of the
dictionaries is collected by our domain experts or
automatically by parsing KDD related documents. For
retrieving cases, we distinguish the attribute part, where
we can make use of the structured domain model's
predefined attributes and their possible values, and the
textual part, which makes use of domain-dependent and
common knowledge stored in the index-vocabulary,
thesauri and term-generalizations. For the textual parts, a
query to the experience base should give results similar to
a package, that contains similar expressions in the form of
the IEs. The resulting overall similarity is then calculated
as a weighted sum of the similarities of all attributes.
Before the experience base can be queried, the packages'
IEs have to be pre-calculated. This is done in an off-line
process.</p>
      <p>WWW-Client
(z.B. Netscape)</p>
      <p>Intranet
http
http
http</p>
      <p>CBR-Works
Case Navigator
CBR-Works
CQL-Server
CBROnline</p>
      <p>CBR-Works
Similarity
Measures</p>
      <p>Case</p>
      <p>Base
CQL-Query</p>
      <p>CQL-Results</p>
      <p>Servlet
Java Virtual Machine</p>
      <p>Webserver</p>
      <p>Fileaccess
Fileaccess</p>
      <p>IE-Generation
(PERL)
HTMLTemplate
Artefacts
2-6</p>
    </sec>
    <sec id="sec-13">
      <title>7 Technical View: Realization of a CBR</title>
      <p>based Experience Factory in the case of</p>
    </sec>
    <sec id="sec-14">
      <title>Knowledge Discovery in Databases</title>
      <p>At DaimlerChrysler, KDD is applied from different
KDDteams in projects from Credit-Scoring to Customer
Relationship Management. We see KDD as a
knowledgeintensive and weak-structured process, where the agents
have to choose on each step from a variety of options
based on their background-knowledge with KDD. Further
we observed, that because of the repetitive application of
a standard-process model in KDD, CRISP-DM1,
experience can be used in successive projects. This makes
organizational team support an important topic in the case
of KDD. Systematic knowledge creation, capture,
organization and use provides a way to support the
KDDprocess model CRISP-DM. We therefore identified types
of knowledge that can improve KDD-processes and ways
on how experience can be integrated using a CBR-based
experience factory for KDD, Core-DM.</p>
      <p>On the organizational side, we implemented the proposed
CBR based experience factory approach with its
processes. The technical architecture of the Core-DM
system can be seen in Figure 6 and is based on the
commercial tool CBR-Works from TecInno. We
implemented an intranet-interface using java-servlet
technology, which communicates with the
CBR-WorksServer using the CQL-case query language. Further, the
EF-teams use the CBR-Works Case-Navigator to author
the experience base. Since the user can access different
artifacts like KDD-reports, presentations or streams of our
Clementine Stream Library2, we further installed a simple
web-server.</p>
      <p>We derived nine types of experience packages to be
stored in the experience-base and disseminated through
the factory (Table 1). We use an object-oriented
packagemodel including generalizations so that common attributes
are shared by different package types. In the next section
we will introduce three of the nine package types in more
detail. These package types represent the different classes
of packages used in the experience base, from very
structured information packages (i.e. artifacts), to
semistructured packages using large textual component (i.e.
lessons learned) in order to represent the knowledge. So
far we collected over 350 packages by evaluating different
KDD-projects and our KDD-documents like guidelines
and handbooks.</p>
      <sec id="sec-14-1">
        <title>7.1 Lessons Learned-Packages</title>
        <p>Experience Packages of this type describe solutions
experienced in a concrete setting of a project (See figure
7). The packages are structured in a part for classification,
a main part of a solution-description, and a part giving
reasons for this solution (See also [HOU99]). For a first
classification of the package, attributes describing a
project3 and the KDD-step, where it occurred, are used.
Especially of interest is the step in CRISP-DM, where the
package has been used or has been created. This is being
modelled by a taxonomy of all possible process phases
and steps and indicates in new projects, where they can be
reused.</p>
        <p>A further context specification is saved additionally to
each package. The KDD- and application-context
description attributes help to characterize the context of
the packages. These features include information about
the overall goal of the KDD-Project (i.e. Prediction or
Description of data), the KDD-problemtype (i.e.
Regression, classification or segmentation of data), and
information about the application context. In this case, we
applied KDD in the area of marketing and
credit-riskmanagement and specify the concrete application within a
taxonomy of these areas. This context also includes
features about the objects being described by the data (i.e.
private customer information or small commercial
2 Clementine is a KDD Tool by SPSS Inc. used by
FT3/AD. Clementine programs are called streams.
3 Projects are described by its own package type not
further described here.
2-7
Domain model part</p>
        <p>Description
customers) and the regional setting of the application.
The content of the packages is further specified by two
attributes. A set of involved objects (i.e. Person, Time,
Data or Product) and the class of problem (taken from the
areas of management, technical problems, KDD-related
problems) help to differentiate the cases. On a
knowledgeperspective, three features represent the origin of the
package in respect to KDD and its processes (General
about KDD, from a KDD project, review of a project), the
specialization of the experience (General, special and
cookbook), the lifecycle (theory, observation and practice)
and the view onto the experience( i.e. Application
Developer, Business Analyst, KDD Engineer or End
User).</p>
        <p>An important characterization is the type of the case, here
we distinguish experience between Best Practice, How
To, Mistake/Critique and Success Factors.</p>
        <p>The experience is described in the main part of the
packages. This is being done in the two text fields, named
topic/problem and solution. So the case information has to
be processed in order to fit into these fields. Further, if it
is possible, the rationale for applying the solution and the
outcome after application can be collected in two further
text-fields. The introduced information entities (IEs) are
calculated over these four fields in order to use textual
CBR techniques. Figure 8 and 9 show, how the experience
base can be queried within our departments intranet.
Core-DM. The structural CBR approach allows for the
specification of attribute values, the textual approach
allows for keyword-search of the packages textual
components.</p>
      </sec>
      <sec id="sec-14-2">
        <title>7.2 Artifact-Packages</title>
        <p>In these packages artifacts of different KDD-processes
and projects are collected for reuse. These artifacts can be
2-8
of different types, i.e. presentations, reports of projects or
code-fragments. The user of the system can, therefore,
specify the type of artifact he wants to retrieve. This
information is represented by an artifact-type in the
experience base. Further, the KDD- and
applicationcontext specification is saved as before in addition to each
artifact-package, and last but not least the CRISP-DM
process step.</p>
        <p>The artifacts are further described by a short abstract, a
detailed description and the project it has been created in.
The content of the artifact is characterized by an attribute
using a taxonomy of content-types. Here one distinguishes
broadly between the result of the KDD-projects like
deliverables, reports, process supporting documentation
(user-guides or reference-models) for a certain application
area. Last but not least, a reference to the concrete artifact
is used, so that it can be downloaded from our web-server.</p>
      </sec>
      <sec id="sec-14-3">
        <title>7.3 Person-Packages</title>
        <p>With this concept the information and especially skills of
persons involved in our KDD-projects can be described.
The description can be separated into two parts. First,
information about the person is being saved as in any
person-register, from names to addresses and phone
numbers.</p>
        <p>In the second part, the skills and roles of a person are
described, making it possible to find persons according to
their knowledge and expertise and who are willing to
share these with others. Rather than using free-text fields
to describe these, the packages domain model gives
predefined attributes in order to characterize the person.
Here the package distinguishes between the
KDDapplication (i.e. credit scoring for new customers) a
person is involved in, its regional setting, the
KDDmethods and techniques (i.e. regression techniques) he is
expert in. Of further interest in the context of KDD are
programming language or product skills, given by a fixed
taxonomy.</p>
        <p>In order to substitute our departments personal register we
also collect traditional individual and person information
in free text fields.</p>
      </sec>
    </sec>
    <sec id="sec-15">
      <title>8 Conclusion</title>
      <p>In this paper we introduced our approach for managing
experiences in KDD-projects. It is based on the
experience factory organization and the approach of case
based reasoning. We therefore investigated how the CBR
based experience factory approach covers the different
aspects of knowledge management, represented by the
approach of Probst et. al. It showed that case based
reasoning and the experience factory approach
complement each other on the technical and
organizational level for our needs. We then introduced our
realization of the approach in the domain of knowledge
discovery in databases. We described our solution for the
experience base, called Core-DM, which is based on a
combination of structural and textual CBR techniques.
In the next steps we plan to evaluate the system Core-DM.
We will derive quantitative and qualitative measures in
order to value aspects like quality of the experience-base,
economic utility, usability and technical performance.
These can then be aggregated to measure the overall
success of our knowledge management initiative. A
further topic of interest is tightly coupled to this
evaluation step. So the maintenance-step of the
experience-base and its packages has to be investigated.</p>
    </sec>
    <sec id="sec-16">
      <title>References</title>
      <p>[ABT97] Althoff, K.-D./ Birk, A./ Tautz, C.: The
Experience Factory Approach: Realizing Learning
from Experience in Software Development
Organizations. Proceedings of the Tenth German
Workshop on Machine Learning, University of
Karlsruhe, 6-8- August, 1997. IESE-Report No.
013.97/E. 1997.</p>
      <p>http://www.iese.fhg.de/pdf_files/iese-013_97.pdf
[ANT98] Althoff, K.-D./ Nick, M./ Tautz, C.: CBR-PEB:
Implementing Reuse Concepts of the Experience
Factory for the Transfer of CBR System Know-How.
Preceeding of the 7th Workshop on Case-Based
Reasoning. IESE-Report No. 058.98/E. 1998.</p>
      <p>http://www.iese.fhg.de/pdf_files/iese-058_98.pdf
[AP94] Aamodt, A/ Plaza, E.: Case-based reasoning:
Foundational issues, methodological variations, and
system approaches. AI-Communications, 7(1), 39-59,
1994
[BT98] Birk, A./ Tautz, C.:Knowledge Management of
Software Engineering Lessons Learned. Technical
Report. IESE-Report No. 002.98/E. 1998.</p>
      <p>http://www.iese.fhg.de/pdf_files/iese-002_98.pdf
2-9
(Database and Expertsystems Application Conference),
London, Lecture Notes in Computer Science,
SpringerVerlag, New York, 2000</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [BCR94]
          <string-name>
            <surname>Basili</surname>
            ,
            <given-names>V.R.</given-names>
          </string-name>
          / Caldiera, G./ Rombach, H.D.:
          <article-title>Experience Factory</article-title>
          . In: J. Marciniak, editor,
          <source>Encyclopedia of Software Engineering</source>
          , vol.
          <volume>1</volume>
          , John Wiley and Sons,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [BAR99]
          <string-name>
            <surname>Bartlmae</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>A CBR based Experience Factory for Data Mining</article-title>
          ,
          <source>in: Proceedings of the International Computer Science Conference: Internet Applications (ICSC'99), Lecture Notes in Computer Science</source>
          , Springer-Verlag, New York,
          <year>1999</year>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [BL00]
          <string-name>
            <surname>Bartlmae</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          / Lanquillon,
          <string-name>
            <given-names>C : A KDD</given-names>
            <surname>Experience</surname>
          </string-name>
          <article-title>Factory: Using Textual CBR for Reusing Lessons Learned</article-title>
          ,
          <source>in: Proceedings of the DEXA2000</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [HEI98b]
          <string-name>
            <surname>Heisig</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Projektmanagement Wissensmanagement.
          <article-title>Wissenstransfer noch Thema</article-title>
          . In: IT Management 7/
          <year>1998</year>
          . In german.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [HOU99]
          <string-name>
            <surname>Houdek</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Empirisch basierte Qualitätsverbesserung</article-title>
          .
          <source>Systematischer Einsatz externer Experimente im Software Engineering. Dissertation</source>
          . Logos-Verlag. Berlin.
          <year>1999</year>
          . In german.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <article-title>[KPMG98] KMPG Management Consulting</article-title>
          , Parlby,
          <string-name>
            <surname>D.</surname>
          </string-name>
          : Knowledge Management.
          <source>Research Report</source>
          <year>1998</year>
          . http://www.kpmg.co.uk/kpmg/uk/services/manage/rese arch/knowmgmt/knowmgmt.pdf
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [PRR99]
          <string-name>
            <surname>Probst</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          / Raub, S./ Romhardt,
          <string-name>
            <given-names>K. Wissen</given-names>
            <surname>Managen</surname>
          </string-name>
          .
          <article-title>Wie Unternehmen ihre wertvollste Ressource optimal nutzen</article-title>
          .
          <source>Gabler. Wiesbaden</source>
          .
          <year>1999</year>
          . In german.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [REI98]
          <string-name>
            <surname>Reinartz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          et. al.:
          <article-title>The Current CRISP-DM Process Model for Data Mining</article-title>
          .
          <source>In: Maschinelles Lernen. S. 1-9</source>
          .
          <year>1998</year>
          . In german.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [RIC98]
          <string-name>
            <surname>Richter</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Introduction</surname>
            (to CBR) In: Lenz,
            <given-names>M.</given-names>
          </string-name>
          /
          <string-name>
            <surname>Bartsch-Spörl</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          / Burkhard, H-D./ Wess, S.:
          <source>Case Based Reasoning Technology. From Foundations to Applications. Lecture Notes in Artificial Intelligence</source>
          . Springer Verlag, New York.
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [WES96]
          <article-title>Wess, S: Fallbasiertes Problemlösen in wissensbasierten Systemen zur Entscheidungsunterstützung und Diagnostik: Grundlagen, Systeme und Anwendungen</article-title>
          .
          <source>Dissertation. Dissertationen zur künstlichen Intelligenz. Bd. 126. Infix Verlag. Sankt Augustin</source>
          .
          <year>1995</year>
          . In german.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>