DOI QR코드

DOI QR Code

A Process-Centered Knowledge Model for Analysis of Technology Innovation Procedures

  • Chun, Seungsu (Division of Knowledge & Information, Korea Institute of S & T Evaluation and Planning)
  • Received : 2015.10.21
  • Accepted : 2016.02.18
  • Published : 2016.03.31

Abstract

Now, there are prodigiously expanding worldwide economic networks in the information society, which require their social structural changes through technology innovations. This paper so tries to formally define a process-centered knowledge model to be used to analyze policy-making procedures on technology innovations. The eventual goal of the proposed knowledge model is to apply itself to analyze a topic network based upon composite keywords from a document written in a natural language format during the technology innovation procedures. Knowledge model is created to topic network that compositing driven keyword through text mining from natural language in document. And we show that the way of analyzing knowledge model and automatically generating feature keyword and relation properties into topic networks.

Keywords

1. Introduction

The growing codification of knowledge and its transmission through communications and computer networks has led to the emerging information society and innovation systems. The knowledge management and analysis has been recognized important issue and to be most essential process on technology innovation and using resource in knowledge based economic environment. But how to organize and inter-operation of information and knowledge integration is to solve complex and very difficult problems[1].

This study is intended as integrating knowledge resources and identification of business activities prior to implementation of knowledge model, focused on verification and optimization of knowledge model for supporting technology innovation. Some change in environment that knowledge based industries and economic development can produce a variety of complexity to the problem. Therefore, business organization for innovation is needed more rapid decision making and strategic analysis. In particular, rapid change of external environment and increase of complex systems require investment strategies and more intelligent decision making process and utilization system[2].

The existing research on knowledge management and analysis is re restricted to syntactic search and classification of data and documents, or the static relationship analysis between data[3][4]. However, as the development has recently been accelerated in the areas of the natural and formal language processing and semantic technology including artificial intelligence, it has be made possible to design a variety of knowledge models and analyze semantically[5][6]. Fig. 1 reflects to design knowledge model of supporting innovation and validating structured process models. And, i have been derived the topic keyword of the process model from task information, generated hierarchical structure model by relation between topic keyword. This topic mapping for process models to model the knowledge network configuration and it is converted into the equivalent relationship.

Fig. 1.Sequential framework for development of unified knowledge model

This is a topic of unstructured information to extract meaning from the process model is mapped to the abstract model is transformed to the equivalent relationship network. This analysis and generation of knowledge model makes possible the design of knowledge maps and analysis of large scale data in the process. Thus, procedure of semantic analysis and relation between processes base on path of tracking evidence of decision making. As shown in figure1, it is to express the logical structure for creating of new knowledge model.

 

2. Process based knowledge model

Knowledge model in this study is conceptually defined as classification and expression of the relationships between knowledge and concepts in knowledge management. It could also map the meaning of knowledge such as typology, property, and pattern. It features the contexts and features of topics, relations between topics, computations, restrictions, which allows topic-based knowledge model to conduct semantic expression and automatic inference and in turn help understand effectively tacit knowledge through knowledge analysis and utilize knowledge[6]. Traditional business model was designed using the BPML(Business Process Modeling Notation), Activity Diagram, UML EDOC, IDEF, ebXML, BPSS, ADF(Activity Decision Flow), RossettaNet, LOVeM, EPCs(Event Process Chains), Petri Nets etc. In study be in a language that was utilized kripke structure[7][8][9][10]. In this study, I was divided and pre-defined a type for classification of knowledge and information on process of technology innovation. In recent research for business process design and validation, and have been investigated actively standardized graphical notation. Graphical notation for the specification of the model is easy to extend[11][12]. This context, semantic analysis based on knowledge model could be viewed as “the processes to understand semantic structures and relations of knowledge” and must consider dynamic changes and continuous expansion of knowledge in designing knowledge model. A resource element of integrated knowledge model is a constituent that maps representative features of information resources like data and documents into a unit of topic and even a piece of data or document could produce various topic relations[13][14].

In this paper, as shown in Fig. 2 and Fig. 3, on business activities in technology innovation was defined as abstracted process model. Here we denoted variables of the process Model(M) and Process(P) and Relations(R), Label(L). The relationship between the process(p) on abstracted process model(M) are denoted through transition and condition, interaction, reference etc. A process(p) has element(E) and element(E) has proposition(X). Each process(P) in the model is defined as follows : each transition(p, p') and has relation(R). Kripke structure’s process model is defined as follows.

M : Process Model = , P : Process = Set of finite process(p), P0 : P0⊆P : Set of initial process, R : (p,p') ∈R⊆P×P =Process transition(∀p∈P, ∃p∈P), X : True | False = Set of AP(Atomic Proposition), L : P → 2X = assigns to each process the set of atomic propositions that are true in that process. P0 : Business Foresight={Trend t, Road Map m}, P1 : Business Planning={Strategy s, Invest i} P2 : Performance Management = {Research r, Development d}, P3 : Measurement Evaluation = {Outcome o, Evaluation e}

Fig. 2.Kripke based abstract process model

Fig. 3.Business process driven topics based knowledge model

Heterogeneous model checking in this study is easy to use for conceptual learning and analysis of knowledge model. And model checking to satisfy in a given model, the path of the property violated the state are presented. In this way, knowledge model to analyze can be continuously refining. The model properties are expressed in CTL(Computation Tree Logic)[9][10]. CTL according to the passage of time representation of the property and it is easy to define a specific scope in model. CTL’s syntax and semantics of temporal quarter is defined by the BNF(Backus Normal Form) as follow.

As existing in CTL, ‘A’ is universal operator and ‘G’ is global operator. And ‘W’ is until operator. Intuitively ‘AG Φ’ is always true in all paths(true) means and ‘A(Φ1W Φ2) means that Φ1 is forever true until Φ2 to be true for all paths in model.

Assuming a finite model M and CTL formula is true in the process. P0 is written as p0 ⊨Φ.

A transition(p, p') in the model(M) indicated reachable from current process(Pn) to next process(Pn+1). Abstract process model(M) has the set of the transition(R). Transition relation (p0,p1)∈R and by a single transition from p0 to p1 is reachable[15].

On reaching it from current process’s forward and backward as the successor the predecessor said. As above, the current process(Pn) can be reached through a transition from one of the hollowing conditions to obtain the ‘image computation’ is called a calculation or reachability analysis.

In the model, as the process reached the bottom of the ‘image computation’ algorithm and the same calculation when to stop is called fix-point. And the ‘reachable’ meaning in CTL formula is defined as follows.

R = {(p0, p1), (p1, p2), (p1, p3), (p2, p3), (p3, p2), (p3, p3), (p3, p0)}, successor(P{}, R) = {p'|∃p∙p∈P∧(p,p')∈R} = pre∀(P), predecessor(P{}, R) = {p|∃p'∙p'∈P∧(p,p')∈R} = post∃(P)

The Symbol μ is operator of least fix-point, and ν is operator of greatest fix-point.

Find(AG(true), P) = R true(P) , Find(A(true ∨ AG(¬Φ)) W Φ), P) = R¬Φ(P))

Find(AG(Φ ⇒ AG true), P) = Rtrue(|[Φ]| ∩ post∃(R¬Φ(P)),

Find(AG(Φ1∧¬Φ2⇒A|[true∨AG(¬Φ2)WΦ2]|), P) = R¬Φ2(|[Φ1∧¬Φ2]|∩Rtrue(P))

Find(AG(true), P) = R true(P), Init Process = {p0}, P{} = {p0} Until(Stepi+1{} = Stepi{}), Fixed Point = RΦ = {p0, p1, p2, p3}, R{p0, p1, p2, p3} = Topic = (t∧m) ∨ (s∧i)∨ (r∧d) ∨ (o∧e)

In figure2, the process based Knowledge model is a tuple , where X is a finite set of placesX0 ⊆ T is the set of initial places.

R ⊆ X × X is the transition relation of places which is assumed to be total:

∀x∈X • ∃s’∈X • (x,x’)∈R.

That is, for every place x∈X, there exists a successor x’∈X with (x,x’)∈R.

T is a finite set of atomic propositions(called semantic topic),

L: T → 2T assigns to each place the set of atomic propositions that are true in that place.

For example, a model is given in Figure 3 and used throughout this paper. In this case, X1 = {x1} and R = {(x2,x1), (x2,x3), (x3,x1), (x4,x5), (x6,x5), (x9,x10), (x7,x10), (x12,x13), (x16,x17), (x14,x15)}. (x,x’)∈R means that s moves to x’ in a step. That is, x’ is an image of x. Given the subset T of places, the set of images of T is image(T) = {x’ | ∃x • x∈T ∧ (x,x’)∈R}. image({x0}) = {x1,x2};

That is, from x0 we can move to both x1 and x2 in one step. A path is an infinite sequence of places in which each consecutive pair of places belongs to R. A place is reachable if it appears on some path starting from some initial place. In example, x16 is reachable since there is a path x0, x1, x2, x5, x6, x7, x8, x9, x11, x12, x15, x16 which leads to that place, while x10 is unreachable since there is no path leads to x10, x13, x14. Every reachable place is computed by successively applying the function image from initial places X0 until the fixed point is reached. The following procedure is to find out every reachable place:

P0 := X0 repeat Pi+1 := Pi ∪ image(Pi) until (Pi+1 = Pi) When Pi+1 = Pi, it stops.

Thus, Pi is called the fixed point, which includes every reachable place. For convenience, let Q = Pi denote the set of reachable places. Q = {x0,x1,x2,x5,x6,x7,x8,x9,x11, x12,x15,x16}. Only x10,x13,x14 does not belong to Q, since it is unreachable from the initial place x0.

Syntax of propositional logic is as follows: ϕ ::= t | ¬ϕ | ϕ1 ∧ ϕ2

As usual, t∈T denote atomic proposition, ¬ is the negation operator, ∧ is the conjunction operator. We write x |= ϕ, if a place x satisfies propositional formula ϕ.

In other words, x|=ϕ means that ϕ is true at a place x. The satisfy relation between x, ϕ is inductively defined as follows:

x |= t iff t∈L(x), x |= ¬ϕ iff x |≠ ϕ, x |= ϕ1 ∧ ϕ2 iff x |= ϕ1 and x |= ϕ2

Theorem 1: Assume Y is the subset of atomic propositions true at x; that is, Y = L(x).

where ∧ is the generalized conjunction operator.

Proof: Proof by induction is used. Basis case: Literals is either t or ¬t. If the formula is t, x |= t since t∈L(x). Otherwise, x |= ¬t since it is not the case that x |= t. Inductive case: Assume that x |= ϕ, where the length of formula ϕ is n. In case the length of formula is n+1, ϕ is either x |= ϕ ∧ t or x |= ϕ ∧ ¬t. According to the assumption, x |= ϕ. Thus we have to show either x |= t or x |= ¬t. But we have already proved them in the basis case. Q.E.D. Theorem 1 says local topics that is a property that holds in a place and computed easily. For example, local topics of x0 is, x0 |= a∧¬b∧c∧¬d∧¬e∧¬f∧¬g∧¬h∧¬i∧¬ j since Y = L(x0) = {a, c}

Definition 1: Place topics is a property that holds in every reachable place and computed as follows:

In the above definition, Q is the set of reachable places, ∨ is the generalized disjunction operator. In our example, place topics I is

Q = { x0,x1,x2, x3,x4,x5,x6,x8,x9,x11,x12,x15,x16}, I= x0:(a∧¬b∧c∧¬d∧¬e∧¬f∧¬g∧¬h∧¬i∧¬ j) …∨ x16:(¬a∧¬b∧c∧¬d∧¬e∧¬f∧¬g∧¬h∧i∧¬ j)

The length of place topics generated in this naïve way is n*m, provided |Q| = n and |T| = m. Although it contains a tremendous amount of useful information, it is likely to be too complex for the user to understand.

 

3. Generating topic based Model and analysis

3.1 Generating topics

Knowledge model(M) depending on the definition of the unit is supposed that every proposition is the topic. Some words in the document representation with a significant suppose a topic. If we represent a particular set of documents from the relationship between the feature keyword and topics to be able to abstract, this is a very simple but useful model is to gain knowledge. Topic as text based knowledge model could be set forth in detail generally as a formal language through insights by experts or be mechanically abstracted by text mining algorithm[16]. This study describes the algorithm to automatically generate knowledge elements by topic and the method to optimize integrated knowledge model.

Through natural language processing, we work activities that exist within specific processes and document analysis and text mining techniques to represent the document is extracted to terms with the characteristics of knowledge model(M) is mapped to an item. Document there is a repository of the address the process(P) is a detailed the task(T) name of directory to define that term for the extraction ‘Weight Entropy’ is shown below.

ai,j = Frequency of terms is presented in the doc., Gi = Collected in the doc., indicated the frequency of term, n = The number of collected doc., di = number of documents with the term, pi,j = ai,j / gi

Automatically extracted terms to find the usefulness of confidence intervals for the topic at creation of knowledge models in terms(Document Term) of the whole document, i calculated from the extracted terms (Detected Term) through analysis of the most appropriate derived values (Roll up Point).

AK : Author Keyword [{Termn}×X] == TM : Text Mining [{Termm}×(X+i)]

while Author Keyword : AK [{Termn}×X] == Text Mining : TM [{Termm}×(X+i)]

if AK ≠ TM, i = 100, m : Set of term in document

than (i = + 100), Result Return(Roll up rate) for(i) : Roll up

else (AK = TM | Roll up rate ≤ 60), Step(end)

In our experiment as shown in figure3, abstract words and keywords automatically extracted from the concordance between the level should be approximately 80-90% effective when seen as an extract of the entire vocabulary of about 15% of the number of its induction into the most appropriate range. In addition, more than 90% of the keyword automatically in order to induce more than 30% of the full term should be induced to have confirmed that. The performance of general algorithm as Bayesian and weighted formula (Weight Entropy) algorithm do not differ.

Therefore, how to extract the keywords from documents is very effective for summarizing and defining of complexity document and knowledge model. In this study as Fig. 4, i used effective index that it calculate relationships between topics on knowledge model. Topics for the relationship between knowledge model extracted from the topics on document for the Closeness and Centrality of topic can be obtained through calculation[17][18].

Fig. 4.Comparative results of weight entropy with general mining algorithms

3.2 Model Analysis

On generation of knowledge model, the structural traits are available through knowledge map and the relations between topics are available through approaches analysis of network model[19][20]. However, in order to understand the implied meaning of knowledge, the interaction and effects between topics should be analyzed in-depth as they are the meaning unit that consist the knowledge. In this chapter, Selton index was referred for using characteristics of connection degree, closeness, and between on certain topics. For the Selton index, the network based topic model was used for efficiency analysis of the topic, and this research analyzed the relations through the higher model which includes accessibility analysis and process. At first, the intensity of relations may be quantification based on the connection intensity between certain topics. Pi is the number of i in certain knowledge, and Pj is the number of j in certain knowledge, and Pij is the number of i and j in certain knowledge.

The connection relations are divided into degree, closeness and medication, and have absolute value and relative value. The degree of centrality has higher connection intensity when certain topics reveal in other major knowledge, and it has central role in the knowledge. The Closeness Centrality has central role when certain topics are close to other topics, and Dij is the shortest course which connects between i and j topics, and g is the number of overall node. Meanwhile, the Between Centrality is interpreted as central role when certain topics have more mediator roles in the relations between topics. gjk is the shortest course number between two topics (j and k), and gjk(i) is the frequency of passing topic i between two topics j and k(j≠k). And g is the number of overall node. Through centrality analysis between topics, I can interpret meaningful characteristics of the knowledge model, and expand the range with difference and connection between knowledge models.

A knowledge map takes a form of an topic in a reachable condition satisfying properties given by a knowledge model. Therefore, a knowledge map which satisfies different properties could unify knowledge networks as Figure 4 and its formula is as below:

Definition 2: Topic based knowledge map M is equivalence knowledge network N

Knowledge Map M → Knowledge Network N, M → N = {X→S, X0→S0, R, T→true, L→E}

iff Knowledge Network N1 = {S1, s0∈S1, R1, T1: ∀|E={true}, E1: ∪{}}

iff Knowledge Network Nm = {Sm, s0∈Sm, Rm, Tm: ∀|E={true}, Em: ∪{}}

then = {S1∪Sm, s0∈S1∪s0∈Sm, R1×Rm, T1∩Tm: ∀E={true}, ∪E1:{}∨Em:{}}

The knowledge network as Fig. 5, is utilized in analyzing figuration and relationship of complexity systems as well as multi-dimensional analysis. And supports analysis of structural networks and paths, relationships, and behaviors of knowledge or topics through changes and unification of a particular knowledge model[20][21].

Fig. 5.Translation from topic map to knowledge network and integration method

Definition 3: Anything properties Φ1 and Φ2 are satisfy knowledge network M

iff Knowledge Model → Knowledge Map(Φ1) ∨ Knowledge Map(Φ2)⊆M

iff Knowledge Map → Knowledge Network : M"

Especially if i could be used to understand for the impact of the various topics and can get a variety of implications. In addition, as shown in Fig. 6, I created a visual analysis tool using the knowledge model structural analysis and hierarchical relationships between topics and took advantage of understanding.

Fig. 6.Visualization and analysis of unified knowledge network

 

4. Conclusion

Many researchers have investigated useful ways of verifying and validating knowledge based model for ontologies and rules, but it is not easy to directly apply them to checking process models. The goal of our research is to generate a set of place topic in given knowledge model, to understand the relation between complex knowledge map and analytical knowledge networks. In previous works every reachable place was to be considered to generate place topic. The complexity of place topic is strongly dependent upon the size of places to be considered. Given a scope in CTL, forward reachability analysis is used to find out a set of states inside it. Obviously, a set of places calculated in this way is a subset of every reachable place. Therefore, generated place topic is used to optimize and translate knowledge map and networks. In this paper, I hoped to generate simple knowledge model for easy interpret and understanding of technology analysts. In this regard, we have suggested extraction algorithm of semantic keyword from complex technical document. To do this, we have defined process model of technology innovation, extracted semantic keyword and relationship property from technical document of each process. The extracted keywords and property is utilized to create and analyze knowledge model.

The keyword in knowledge model is a property and semantic topics that holds in every reachable process on knowledge model. Not only can be used in understanding and analysis of complex knowledge’s relationships. For doing so, we let the user focus on some interested parts rather than a whole process state in a semantic model. Obviously, a set of process calculated in this way is a subset of every reachable process. Therefore, generated process and keyword is used to optimize and translate knowledge model. In future research, effective integration method of the extracted knowledge model and it based innovation trends will be applied actual system.

References

  1. Foray, Dominique, and B. A. Lundvall, “The Knowledge-Based Economy,” OECD, Paris, 1997.
  2. S. Haag, Baltzan, “Business Driven Technology,” Published by McGraw-Hill, 2006.
  3. H. Shah, Kourdi, "Frameworks for Enterprise Architecture," IEEE computer society, IT Professional, pp. 36-41, September, 2007. Article (CrossRef Link).
  4. Scholer, A., Rickel, "Learning domain knowledge for teaching procedural task," AAAI-2000, 2000.
  5. David Y.W. Park, "Static Analysis to Identify Invariants in RSML Specifications," in Proc. of Formal Techniques in Real-Time and Fault-Tolerant '98, LNCS 1486, pp. 133-142, Springer, 1998. Article (CrossRef Link).
  6. M. D. Ernst,(ed.), "Dynamically Discovering Likely Program Invariants to Support Program Evolution," ICSE, 2000.
  7. M. B. Dwyer, "Property specification patterns for finite-state verification," in Proc. of the Workshop on Formal Methods in Software Practice, 1998. Article (CrossRef Link).
  8. William Chan, "Temporal-logic Queries," in Proc. of CAV 2000, LNCS 1855, pp. 450-463, Springer, 2000. Article (CrossRef Link).
  9. R. Bryant, “Graph-Based Algorithms for Boolean Function Manipulation,” IEEE Transactions on Computer, Vol. 100, No. 8, pp. 677-691, 1986. Article (CrossRef Link). https://doi.org/10.1109/TC.1986.1676819
  10. W.Y. Liu, "Using IDEF0/Petri net for Ontology based task knowledge analysis: the case of emergency response for debris flow," System Sciences, HICSS Volume 4, 2006. Article (CrossRef Link).
  11. S. Edelkamp, S. Jabbar, “Action Planning for Directed Model Checking of Petri Nets,” Electronic Notes in Theoretical Computer Science, Vol. 149, No. 2, pp. 3-18, 2006. Article (CrossRef Link). https://doi.org/10.1016/j.entcs.2005.07.023
  12. J. B. Rosser, “On the Complexities of Complex Economic Dynamics,” Journal of Economic Perspectives. Vol. 13, No. 4, pp. 169-192, 1999. Article (CrossRef Link). https://doi.org/10.1257/jep.13.4.169
  13. D. Ruan, F. Hardeman, “Intelligent Decision and Policy Making Support Systems,” Vol. 117, Springer, 2008. Article (CrossRef Link).
  14. Robert J. Thierauf, “Optimal Knowledge Management,” IDEA Group Publishing, 2006. Article (CrossRef Link).
  15. V. Devedzic, “Knowledge Modeling - State of the art,” ACM integrated computer aided engineering, Vol. 8, No. 3, pp. 257-281, 2001.
  16. Edmund M. Clarke, "Model Checking," in Procs of the 17th Conference on Foundations of Software Technology and Theoretical Computer Science, pp. 54-56, 1997.
  17. A. Neuser, “A GUI for Real-time Visualization of On-line Model Checking with UPPAAL,” Institute for Software Systems, Hamburg University of Technology (TUHH), 2014.
  18. Savas Konur, “Towards Light-Weight Probabilistic Model Checking,” Journal of Applied Mathematics, Hindawi publishing corporation, 2014. Article (CrossRef Link).
  19. Dou, Wei, “A Model-Driven Approach to Trace Checking of Temporal Properties with OCL,” Open Repository and Bibliography, S&T Centre, University of Luxembourg, 2014.
  20. Campos, J. Creissac, Machado, “A specification patterns system for discrete event systems analysis,” InTech, repositorium.sdum.uminho.pt, 2013. Article (CrossRef Link).
  21. DR Kulkarni, “Improved model generation and property specification for analog/mixed-signal circuits,” async.ece.utah.edu, 2013.