Associating Absent Frequent Itemsets with Infrequent Items to Identify Abnormal Transactions

Size: px
Start display at page:

Download "Associating Absent Frequent Itemsets with Infrequent Items to Identify Abnormal Transactions"

Transcription

1 Assocang Absen Frequen Iemses wh Infrequen Iems o Idenfy Abnormal Transacons L-Jen Kao Deparmen of Compuer Scence and Informaon Engneerng Hwa Hsa Insue of Technology New Tape Cy, Tawan ljenkao@cc.hwh.edu.w Yo-Png Huang * Deparmen of Elecrcal Engneerng Naonal Tape Unversy of Technology Tape, Tawan yphuang@nu.edu.w *correspondng auhor Frode Eka Sandnes Insue of Informaon Technology Faculy of Technology, Ar and Desgn Oslo and Akershus Unversy College of Appled Scences Oslo, Norway Frode-Eka.Sandnes@hoa.no Absrac Daa sored n ransaconal daabases are vulnerable o nose and oulers and are ofen dscarded a he early sage of daa mnng. Abnormal ransacons n he markeng ransaconal daabase are hose ransacons ha should conan some ems bu do no. However, some abnormal ransacons may provde valuable nformaon n he knowledge mnng process. The leraure on how o effcenly denfy abnormal ransacons n he daabase as well as deermne wha causes he ransacons o be abnormal s scarce. Ths paper proposes a framework o realze abnormal ransacons as well as he ems ha nduce he abnormal ransacons. Resuls from one synhec and wo medcal daa ses are presened o compare wh prevous work o verfy he effecveness of he proposed framework. Keywords daa mnng; abnormal ransacons; absen frequen emse; nfrequen ems; assocaon rules. 1 Inroducon Daa mnng s an emergng echnology used o dscover neresng paerns from large daabases. In he pas more effors repored n leraure were focused on developng effcen mehods o fnd assocaon rules [5, 9-11, 14, 30, 32]. Recenly, ouler deecon aracs aenon due o s mporance n deecng devan daa. Several applcaons rely on ouler deecon for he dscovery of val nformaon such as cred card fraud deecon, nework nruson deecon, abnormal numerc values n sock prces, and dsease sympom dagnoss [1-2, 6-7, 12, 17-18, 28, 30-31, 34-36]. Though some breakhroughs have been repored on ouler deecon, here reman crcal ssues o be resolved. Frs, mos ouler deecon algorhms are desgned for numercal daa and rely on compung he relave dsance beween daa pons. These algorhms are no suable for daases wh caegorcal arbues [17]. The followng example llusraes why dsance measurng mehods fal o deec oulers n caegorcal daases. Table 1 conans 10 ransacons ha can be dvded no 3 ypes,.e., {em1, em2}, {em1} and {em2}. If he mnmum suppor and mnmum confdence are se o 50% and 80%, respecvely, he rule em1 em2 wll be an assocaon rule, nsead of he rule em2 em1. Accordng o he derved assocaon rule, f someone buys em1, s very possble ha hey wll buy em2 a he same me. Tha s, a ransacon wh only em1 may be an ouler, bu no a ransacon wh only em2. However, Fg. 1 shows ha f he daa pon (em1, em2) s he cener of a cluser and (em1) s an ouler, hen (em2) should be an ouler, oo. In oher words, f ransacons wh only em1 are possble oulers, hen ransacons wh only em2 should also be possble oulers accordng o dsance. Wh hgher dmensonales more ransacons wll be ncorrecly classfed as oulers. Many commercal applcaons rely on markeng ransaconal daabases wh boh caegorcal and numercal arbues. Fndng ouler ransacons n such daabases s mporan snce oulers for nsance may affec markeng managemen or sales sraeges. Only a handful of sudes have focused on he deecon of ouler ransacons from caegorcal daases or ransaconal daabases [15-16, 18, 21, 27]. He e al. [16] proposed an enropy-based mehod o deec oulers. The FndFPOF (Frequen Paern Ouler Facor) algorhm [17] s anoher well-known em-based ouler deecon echnque. He, Xu and Deng [17] defned an ouler ransacon as a ransacon wh few frequen paerns. FndFPOF frs dscovers frequen emses and hen fnds oulers by comparng each ransacon wh every frequen emse. The drawback of hs algorhm s ha he effcency deeroraes wh he ncrease of frequen emses. Nara and Kagawa [26] proposed anoher em-based approach where oulers are assumed o be ransacons ha volae mos assocaon rules. Nara and Kagawa s work shared smlares wh ha of He, Xu and Deng [17], bu hey reduced he search space o expede he search of oulers n large daases. Table 1. Transaconal daabase sample 1. TID: Transacon IDenfcaon number. TID Iems 1 em1, em2 2 em1, em2 3 em1, em2 1

2 4 em1, em2 5 em1, em2 6 em1 7 em2 8 em2 9 em2 10 em2 Fg. 1. The relaonshp beween daa pon (em1, em2). Secondly and mos mporanly, he aemps documened n he leraure dd no offer any suggesons on wha caused he ransacons o become abnormal. In fac, he abnormal ransacons hemselves provde worhless nformaon on decson makng. For example, assume ha an assocaon rule r, {Jam, Mlk} {Bread}, s derved from a ransaconal daabase D wh he hgh confdence value of 80%. Accordng o he assocaon rule r, he ransacon <Bacon, Corn, Jam, Mlk> may be abnormal due o he absence of Bread. There s no benef of knowng he absence of a ceran em due o s rrelevance. However, f one can fnd he reasons for occurrence of oulers ha wll help us make beer decson. There may be a varey of reasons behnd he abnormal ransacons. For example, a cusomer may wan o buy bread, bu found ha he or she dd no brng enough cash. In hs example s no easy o explore he underlyng reasons for no buyng bread. Bu, f he reason s because he emergence of some ems leads o he dsappearance of some oher ems, hen fndng he reason s ransformed o he ssue of denfyng whch ems cause some oher ems absence and hs can be resolved by usng he proposed mehod. In he aforemenoned example, could Bacon or Corn be he em ha makes Bread absen? The dea o denfy whch ems cause some oher ems absence s praccal. The reason s f users can apply assocaon rules o fnd he relaonshps beween ems, hey can also use assocaon rules o fnd he relaonshps beween ems and absen ems. Ths paper proposes a framework for denfyng he ouler ransacons n markeng daabases and fndng whch ems may cause ransacons o become oulers. The framework s dvded no wo pars. The frs par of hs sudy s o ulze assocaon rules o effcenly denfy abnormal ransacons n daabase. An abnormal ransacon s defned as a ransacon ha s expeced o conan some ems ha acually do no appear. Those ems ha should have been conaned are marked as absen ems. Absen ems hemselves hardly provde any value n decson makng unless he reasons ha cause he ems absence can be found. The second par of hs sudy uses assocaon rules mnng algorhm o exrac he relaonshp beween absen frequen ems and nfrequen ems. Typcally, he nfrequen ems ha are always gnored n assocaon rules sudy may be he key o cause ransacons o be abnormal [13]. Our approach s o ransform each ransacon o absen frequen emses and nfrequen ems. These new ransformed ransacon can be mned by employng an assocaon rules mnng algorhm o fnd he relaonshp beween nfrequen ems and absen ems. The remanng secons of hs paper are organzed as follows. Secon 2 nroduces relaed work on ouler deecon. Secon 3 descrbes he proposed mehod and secon 4 denfes ems ha nduce abnormal ransacons. Secon 5 provdes expermenal evdence. Secon 6 concludes he paper. 2 Relaed work 2.1 Frequen emses and assocaon rules A frequen emse s an emse ha conans a ceran number of ransacons. Assocaon rules can be derved from frequen emses. The well-known assocaon rule example derved from supermarke shoppng daa s {daper}{beer}, whch means people buyng daper wll also buy beer a he same me. The assocaon rules help busnesses o plan proper sraeges o ncrease her sales. The followng s a bref descrpon of how assocaon rules are found based on a ransaconal daabase. 2

3 Le be he se of all ems. A ransaconal daabase D s a se of ransacons where each ransacon s a se of ems such ha. The cardnaly of he daabase D s denoed by D. For wo emses X, Y I and X Y, he rule X Y means f X occurs hen Y also occurs. An emse X s suppor s denoed by suppor(x): X suppor(x). (1) D An emse s frequen f s suppor s larger han or equal o a pre-defned suppor hreshold mn_sup. The confdence of X Y s defned as confdence(xy): suppor( X Y ) confdence ( X Y ). (2) suppor( X ) An assocaon rule s a rule wh s confdence larger han or equal o a pre-defned hreshold mn_conf. The Apror-based algorhm s usually adoped for mnng assocaon rules. The orgnal Apror-based algorhm s neffcen because repeaedly scans he same daabase o fnd frequen emses. Varous non-apror mehods have been proposed o expede he dscovery of assocaon rules [14, 20, 33]. FP-growh [14], a well-known non-apror assocaon rules mnng algorhm, scans he daabase wce o buld an FPree where all he frequen emses are sored. Each branch n he FP-ree s a frequen emse. The assocaon rules are hen mned from he FP-ree. Snce he FP-ree s a compac srucure, s performance s beer compared o he Apror famly of algorhms [11, 19]. 2.2 Maxmal frequen emses The FP-ree srucure s no only used o generae assocaon rules bu also a good daa srucure for applcaons ha only need o ulze he nformaon of frequen emses. However, f here are many long ransacon paerns or he mnmum suppor seng s low, he number of frequen emses and he FP-ree sorage wll be huge [11]. In hs case, one can consder geng mal frequen emses nsead of frequen ems. A frequen emse X s a mal frequen emse (MFI) f here s no oher frequen emse Y such ha X Y. Any subse of a mum frequen emse s a frequen emse; ha s, one sll can ge frequen emses from mum frequen emses. Snce he oal number of mum frequen emses s less han frequen emses, sorage requremens are reduced. Several algorhms, such as MAFIA [5], GenMax [9] and FP [10], fnd mal frequen emses. FP s based on FP-growh and s proven o be a compeve algorhm [19]. FP bulds an FP-ree lke srucure called an MFI-ree, o keep rack of all mum frequen emses. Subsequen research has proposed more effecve algorhms for acqurng mum frequen emses; however, snce hey are exensons of FP-growh and have huge sorage requremens, he algorhm proposed heren employs FP o fnd mum frequen emses. The followng example llusraes how FP fnds mum frequen emses [10]. Table 2 lss a sample daabase ha conans 10 ransacons. The mnmum suppor s se o be 20%. Fg. 2 shows he fnal complee FP-ree. If a FP-ree has only one pah, s a MFI-ree. Snce he FP-ree n Fg. 2 s no a sngle pah ree we frs fnd a condonal paern base and condonal FP-ree for each em n he header able. For example, he correspondng condonal FP-ree of em f s shown n Fg. 3. The ems n he condonal paern base are lsed n descendng order accordng o frequency. Noe ha f he condonal FP-ree of an em has more han one pah he FP-ree needs o be separaed no several sngle-pah rees. The nal MFI-ree only conans he header able. The frs em f s condonal FPree s nsered no he MFI-ree and he emse {a, c, e, b, f} s a mum frequen emse. The followng sep nvolves checkng f he em d s condonal FP-ree, {a, c, d}, s a subse of any mum frequen emse n he MFI-ree. If s no a subse, s nsered no he MFI-ree. Nex, he em b s condonal FP-ree, {a, c, e, b}, s a subse of {a, c, e, b, f} and wll no be nsered. Ths subse-checkng sep s repeaed unl all he ems n he header able are processed. Fg. 4 shows he complee MFI-ree. Table 2. Transaconal daabase sample 2. TID Iems 1 a, b, c, e, f, o 2 a, c, g 3 e, 4 a, c, d, e, g 5 a, c, e, g, l 3

4 6 e, j 7 a, b, c, e, f, p 8 a, c, d 9 a, c, e, g, m 10 a, c, e, g, n roo a:8 e:2 c:8 e:6 g:1 d:1 b:2 g:4 f:2 d:1 Fg. 2. The complee FP-ree derved from Table 2. roo a c e b Fg. 3. The condonal FP-rees for em f n header able. roo f a c e d b g f 2.3 The defnon of ouler ransacons Fg. 4. The complee MFI-ree for he daase n Table 2. There s no defne ouler defnon. Dfferen applcaons defne oulers dfferenly wh dfferen ouler deecon approaches. Sascal mehods f he daase o assumed dsrbuons, and daa are deermned o be oulers accordng o how well hey f no he daase [17]. However, he underlyng dsrbuon for a ceran daase may no mach he assumed dsrbuon and consequenly affec he ouler deecon accuracy. Anoher problem nvolves daases wh hgh dmensonaly as s dffcul o esmae muldmensonal dsrbuons [22]. Dsance-based mehods defne a daa pon p n a se D as an ouler f a ceran percenage of oher pons n D are more han a pre-defned dsance away from p [4]. Approaches usng hs defnon [1, 34] have he drawback of hgh compuaon complexy when processng large daases, makng dffcul o fnd local oulers [22]. Cluserng mehods are also used o denfy oulers, ha s, pons ha are no nsde any of he clusers [2, 3, 8, 24, 29]. Cluser-based mehods frs denfy he clusers, hus he effcency depends on how clusers are formed. Densy-based mehods [18] denfy he oulers by comparng he densy of he npu daase, and consder he oulers as pons lyng n low densy regons. 4

5 All he menoned mehods are nended for daases wh numercal arbues and rely on daa pon dsance measures o deermne oulers. Markeng ransaconal daabase, a mul-caegorcal arbues daase, s usually employed o record daa by commercal applcaons. In such mul-dmensonal daase, he concep of proxmy may no be meanngful [23]. Tha s, an ouler ransacon s no a daa pon; herefore, one canno use he concep of dsance measures o deermne. The aforemenoned mehods are also no suable o deec ouler ransacons, even f some of he mehods map he caegorcal arbues o numercal arbues before dsances beween daa pons are compued. The approach sll faces he problem ha he mappng resuls are no conssen across dfferen mappng orderngs. Only a few sudes have focused on denfyng oulers from ransaconal daases [15-16, 27]. He e al. assumed ha ransacons conanng less frequen emses are more lkely o be ouler ransacons [17]. They defned he Frequen Paern Ouler Facor (FPOF) o evaluae wheher a ransacon s an ouler or no. Nara and Kagawa were neresed n assessng f a ransacon s lkely o be an ouler when some ems are supposed o appear, bu acually do no appear [26]. Based on hs concep, an ouler degree s defned o evaluae wheher a sngle ransacon s an ouler or no. Accordng o her expermens Nara and Kagawa clam ha her approach can derve more accurae resuls compared o oher approaches such as [17]. The followng example llusraes he concep of ouler ransacons [26]. In order o derve assocaon rules from he ransaconal daabase n Table 3, he mnmum suppor and mnmum confdence are se o 50% and 80%, respecvely. Table 4 gves paral assocaon rules generaed from Table 3. Snce all he rules n Table 4 have hgh confdence, we see ha TID 2 <Bacon, Corn, Jam, Mlk> s abnormal. By checkng RID 2, hs ransacon does no nclude he em Bread ha s supposed o appear n he ransacon. In fac, TID 2 s an ouler accordng o [26]. Table 3. Transaconal daabase sample 3. TID Iems 1 Bread, Jam, Mlk 2 Bacon, Corn, Jam, Mlk 3 Bread, Jam, Mlk 4 Bacon, Bread, Corn, Egg, Mlk 5 Bacon, Bread, Corn, Egg, Jam, Mlk 6 Bread, Corn, Jam, Mlk 7 Bacon, Bread, Egg, Mlk 8 Bacon, Bread, Egg, Jam, Mlk 9 Bread, Jam, Mlk 10 Bacon, Egg, Mlk Table 4. Assocaon rules derved from Table 3. RID: assocaon Rules IDenfcaon number. RID Rule 1 {Jam} {Bread} 2 {Jam, Mlk} {Bread} 3 {Jam} {Bread, Mlk} 4 {Bacon} {Egg} 5 {Bacon, Mlk} {Egg} 6 {Bacon} {Egg, Mlk} 7 {Mlk} {Bread} 2.4 The approach o deec abnormal ransacons Ths sudy ams o fnd he ems whch cause ransacons o be abnormal. From he perspecve of ouler managemen, we can approach from denfyng he ouler ransacons, analyzng he causaly among ems and hen dscoverng he reasons behnd he abnormaly. Thus, our proposed abnormal ransacon deecon model wll sar from defnng wha abnormal ransacons are. Then, we wll propose our new fndng on deecng abnormal ransacons as well as on denfyng ems ha nduce abnormal ransacons. The underlyng secon wll nroduce he defnons on dervng ouler degree. Defnon 1. Le be a ransacon, e be an em, and R be he se of hgh confdence assocaon rules. s assocave closure + s defned as follows: 5

6 0 1 e ey and X and X Y R The emse +1 ncludes he em ha should appear n bu acually does no. The emse +1 wll converge f has no more ems ha should appear bu acually do no appear. The assocave closure + s an deal form for and does no volae any assocaon rule. Defnon 2. Le be a ransacon, R be he se of hgh confdence assocaon rules, and + be he assocave closure of. The ouler degree of s defned as od () : od ( ). (3) The ouler degree value s n he range 0 and 1. For example, he assocave closure + for TID 2 n Table 3 s <Bacon, Corn, Bread, Egg, Jam, Mlk>. The ouler degree for TID 2 s herefore equal o Noe ha f + 6 s equal o, he ouler degree, od(), s equal o 0. Defnon 3. An ouler ransacon s a ransacon wh an ouler degree od() greaer han or equal o mn_od, a pre-defned ouler degree hreshold. If he mn_od s se o 0.3, TID 2 n Table 3 s an ouler. Snce he effcency of he algorhm deeroraes wh he ncrease of ransacons sze, here s a need o mprove he algorhm o reduce he me complexy. The basc dea s o reduce he sze of boh ransaconal daabase and he se of assocaon rules. Defnon 4. Le M be he se of all mal frequen emse, be a mal frequen emse and. A ransacon s mal assocave closure s defned as follows: 0 1 e em and m Defnon 5. Le od() be s ouler degree, and od() s upper bound s derved as follows: od ( ). (4) If he upper bound of a ransacon s ouler degree s less han mn_od, hen he ransacon s marked as an ouler. Insead of usng assocae rules se o denfy oulers, one can frs ulze mal frequen emses wh comparavely smaller daa sze o calculae each ransacon s upper bound of ouler degree and hen prune ransacons wh upper bounds less han mn_od. Ths helps o reduce he ransacon se, and he ouler degree s only compued for he remanng ransacons. Consequenly, he ouler degree calculaon effcency s sgnfcanly mproved. Defnon 6. An assocaon rule X Y s a non-redundan rule f no oher rules Z W and S V such ha (), and (), respecvely. Accordng o Defnon 6, RID 1, 2 and RID 4, 5 n Table 4 are redundan snce hey can be descrbed by RID 3 and RID 6, respecvely. The redundan rules can be removed from he orgnal assocaon rules se and he se of all non-redundan rules s denoed as he mnmal rules se R mn. The sze of R mn s smaller han he orgnal assocaon rules se and a ceran ransacon s assocave closure ha derved from assocaon rules se s he same as he assocave closure derved from he mnmal rules se. Defnon 6 and RID rules gven n Table 4 ndcae ha he effcency of he ouler dscovery algorhm s mproved when he number of assocaon rules s reduced. 6

7 3 Infrequen ems and ouler degree Ouler degree s a measuremen o deermne how many frequen ems are absen n a specfc ransacon. I decdes he compleeness n deecng possble ouler ransacons. By rerospec o Eq.(3), one can fnd ha he nfrequen ems also affec he ouler degree calculaon. We wll show ha here s no need o ake nfrequen ems no consderaon n calculang ouler degree. Defnon 7. Infrequen ems are ems ha are no conaned n any frequen emses. The nfrequen ems n he ransacon se n Table 5 are Baery and Corn usng he same mnmum suppor and mnmum confdence as n he prevous example. Compared wh TID 2 n Table 3, hs ransacon has only one addonal nfrequen em, namely Baery. The assocave closure for TID 2 s <Bacon, Corn, Jam, Mlk, Baery, Bread, Egg>, and he ouler degree s equal o 2/7. If he mnmum ouler degree s se o 0.3, hs ransacon s no longer an ouler. Bu accordng o [26], an ouler s a ransacon wh some ems ha are expeced o appear, bu do no. TID 2 n Table 5 s effecvely he same as TID 2 n Table 3, and should be an ouler. The problem arose from he defnon of assocave closure where he more nfrequen ems a ransacon has, he more normal he ransacon s. The ouler degree calculaon on TID 2 and 10 n Table 5 are nfluenced by he number of nfrequen ems and here are no oulers f he mnmum ouler degree s se o 0.3. Based on hs observaon, one should remove nfrequen ems from he ransacons before calculang he ouler degree. Ths dscovery s based on he fac ha he ouler degree s used o ndcae how many frequen ems are mssng; herefore, he nfrequen ems should no be consdered n calculang ouler degree. Table 5. Transaconal daabase sample 4. TID Iems Infrequen od() ems 1 Bread, Jam, Mlk 0 2 Baery, Bacon, Corn, Jam, Baery, 2/7 Mlk Corn 3 Bread, Jam, Mlk 0 4 Bacon, Bread, Corn, Egg, Corn 0 Mlk 5 Bacon, Bread, Corn, Egg, Corn 0 Jam, Mlk 6 Bread, Corn, Jam, Mlk Corn 0 7 Bacon, Bread, Egg, Mlk 0 8 Bacon, Bread, Egg, Jam, 0 Mlk 9 Bread, Jam, Mlk 0 10 Baery, Bacon, Egg, Mlk Baery 1/5 Convenonal mehod n calculang ouler degree canno ruly reflec he role of oulers n ransacons. We herefore redefne a ransacon s assocave closure and s mal assocave closure o dscover he ouler ransacons. Defnon 8. Le be a ransacon, R be he se of hgh confdence assocaon rules, and be he se of all nfrequen ems. - s denoed as s frequen ransacon f all he nfrequen ems are removed from. s assocave closure + s defned as follows: 0 1 e e I r e ey and X and X Y R Defnon 9. Le be a ransacon, R be he se of hgh confdence assocaon rules, and + be he assocave closure of. The new ouler degree of should be defned as od () : od ( ). (5) 7

8 Defnon 10. Le M be he se of all mal frequen emse and s defned as follows:. A ransacon s mal assocave closure 0 1 e e I r e e m and m Defnon 11. Le od() be s ouler degree, and od() s upper bound s derved as follows: od ( ). (6) Fg. 5 shows he proposed ouler degree algorhm. 4 Fndng ems ha make ransacons abnormal 1.Ge he assocaon rules se R from a ransaconal daabase D by employng FP-growh algorhm. 2.Ge he mum frequen emses se M by employng FP algorhm. 3.Reduce he sze of he ransaconal daabase. Ge each ransacon s frequen ransacon - and hen calculae s ouler degree upper bound od. Remove ransacons whose od are less han mn_od. The remanng ransacons are he canddaes of ouler ransacons. The remanng ransacons se s denoed as D mn. 4. Reduce he sze of he assocaon rules se. Remove redudan rules from R and ge he mnmum assocaon rules se R mn. 5.Ge he ouler ransacons se OT. For each n D mn Ge each ransacon s frequen - and s assocave closure by checkng R mn. Calculae s ouler degree od(). If od() >= mn_od hen OT = OT { - } Fg. 5. The proposed ouler degree algorhm. The proposed ouler degree measuremen mehod allows us o denfy abnormal ransacons. However, wha s he benef from dscoverng oulers? Can he dscovery of oulers provde valuable nformaon o furher mprove decson makng? Usually, resuls from daa mnng help users realze unknown bu mporan facs and users can ulze hese facs o do some beer decson makng. For example, he famous assocaon rule, {dapers} -> {beers}, mned from real sores daabases shows ha hose who purchase dapers end o also buy beers when hey go grocery shoppng. Based on hs observaon, he realers sock dapers nex o he beer coolers o ncrease revenues. Inruson deecon, anoher ouler mnng example, provdes me seres paerns o help users predc possble nruson evens. Whle an abnormal ransacon may be deeced, here s nohng we can do abou. A frs glance, seems ha he abnormal ransacons hemselves dd no provde valuable nformaon for knowledge mnng and he proposed algorhm has no major mprovemen over he convenonal mehods. However, he major conrbuon of he presened work les n fndng he ems ha cause ransacons o be abnormal. Accordng o our knowledge here s no leraure ha suded on converng he oulers no useful knowledge. There could be housands of reasons ha cause ransacons o be abnormal. Some reasons, lke human errors, are no easy o predc and ryng o explore 8

9 hem s beyond he scope of hs sudy. Bu, nfrequen ems may cause abnormal behavor n some applcaons [13] and we should go one sep furher o denfy whch ems cause ransacon o be abnormal. To counerbalance hs problem, a mehod s proposed for analyzng he relaonshps beween nfrequen ems and abnormal ransacons and denfyng nfrequen ems ha ofen cause ceran frequen ems absence. Assocaon rules mnng fnds ems ha are frequenly occurrng ogeher. However, he mechansms for fndng assocaon rules can also be appled o fndng nfrequen ems ha cause specfc frequen ems o be dscarded. Before he assocaon rules mnng algorhm can be appled o fnd nfrequen ems ha cause ransacons o be labeled as abnormal, each ransacon s ransformed no wo pars, namely absen frequen emses and nfrequen ems. Defnon 12. Le be a ransacon, R mn be he se of mnmum assocaon rules. An absen frequen emse (AF) s defned as follows: AF( ) e e X Y and X and Y and X Y R mn. Table 6 lss he ransformed ransaconal daabase from Table 5. There are no absen frequen emses and nfrequen emse n TID 1 n Table 5. TID 2 has hree absen frequen emses, {Mlk, Bread*}, {Mlk, Jam, Bread*} and {Bacon, Mlk, Egg*}, an asersk s used o denoe ha he em s expeced, bu acually does no occur. The nfrequen ems for he ransacon are Baery and Corn. TID 4 has no absen frequen emses, bu has one nfrequen em Corn. TID 10 has one absen frequen emse, {Mlk, Bread*}, and s nfrequen emse s Baery. In he ransformed ransacon se, each absen frequen emse s vewed as an em, and he relaonshps beween absen frequen emses and nfrequen ems can be found. The complee algorhm ncludng ouler deecon and fndng he relaonshp beween nfrequen ems and oulers s shown n Fg. 6. Table 7 shows an example used o verfy ha he proposed mehod can fnd he relaonshp beween ouler and s nfrequen ems. The ems n he synhec ransacon se are a, b, c, d, e, f, g, h and. Table 8 shows paral frequen emses and par of assocaon rules derved from Table 7 nclude {c}{d}, {d}{c}, {c, f}{d}, {d, f}{c} f he mnmum suppor s se o 50% and mnmum confdence s 80%. Accordng o Table 8, he nfrequen ems are a, e, g, h, and. If he mnmum ouler degree s se o 0.5, hen ransacons 5, 8, and 14 are oulers. Table 6. The ransformed ransaconal daabase. Each ransacon s dvded no unoberved frequen emses and nfrequen emses. TID Iems Noe 1 denoes no absen frequen emse and no nfrequen em. 2 Mlk/Jam/Bread*, Bacon/Mlk/Egg*, Mlk/Bread*, Baery, Corn The absen frequen emse {Mlk, Jam, Bread*} s vewed as an em, and s denoed as Mlk/Jam/Bread*. Baery and Corn are nfrequen ems. 3 4 Corn Corn s an nfrequen em. 5 Corn Corn s an nfrequen em. 6 Corn Corn s an nfrequen em Mlk/Bread*, Baery 9

10 1.Ge he ouler ransacons se OT from he ransaconal daabase D. 2.Transform OT o OT rans Transform each n OT o rans by dvdng no wo pars, absen frequen emses and nfrequen ems. 3.Ge he assocaon rules se R rans from OT rans. Fg. 6. Algorhm for fndng abnormal ransacons and denfyng whch ems cause ransacon o be labelled as abnormal. The frs sep nvolves ransformng each ouler no wo pars, namely absen frequen emses and nfrequen ems. TID 5 has wo absen frequen emses, {d, c*} and {d, f, c*}, and e and h are nfrequen ems. TID 8 has wo absen frequen emses, namely {c, d*} and {c, f, d*}, and e and are nfrequen ems. The fnal ransformaon resul s shown n Table 9. Table 9 can be vewed as a new ransaconal daabase and each absen frequen emse can be reaed as an em. By applyng assocaon rules mnng wh mnmum suppor and mnmum confdence se o 50% and 80%, respecvely, we fnd * * ha em h s he one ha nduces he abnormal ransacon snce rule { h} { d, f, c } and { h} { d, c } can be derved from Table 9. I means ha em c should appear, bu because of he nfrequen em h, em c s no observed n he ransacon. Tha s, em h causes he ransacon o be marked as abnormal. Table 7. A paral synhec ransaconal daabase. TID Iems 1 c, d, f, g 2 a, b, c, d, e, g 3 a, c, d, f 4 c, d, h, 5 d, e, f, h 6 a, c, d, f, e, g 7 b, c, d, e, f 8 b, c, f, e, 9 c, d, e, f, g, 10 b, c, d, f 11 a, b, c, d 12 b, g 13 c, d, f, h 14 b, d, f, h 15 b, c, d, f 16 c, d, f, g Table 8. Par of frequen emses and nfrequen ems derved from Table 7. 1-em frequen emses 2-em frequen emses 3-em frequen emses nfrequen ems {b} {c} {d} {f} {f, c} {f, d} {c, d} {f, c, d} a, e, g, h, Table 9. The ransformed ransaconal daabase accordng o Table 8. TID Iems 5 d/c*, d/f/c*, e, h 8 c/d*, c/f/d*, e, 14 d/c*, d/f/c*, h 10

11 5 Expermenal resuls and dscusson Three expermens were conduced o evaluae he effecveness of he algorhm. The proposed algorhm was mplemened n Dev C++ and expermens were run on a worksaon wh an Inel 2.5GHz processor and 2G of memory. The FP-growh s adoped o mne frequen emses and assocaon rules. The mum frequen emses are derved by usng FP. The frs expermen uses a synhec daa se as npu generaed usng IBM Ques synhec daa generaor. The parameer sengs for he daa generaon are: () he oal number of ransacons D =532, () average sze per ransacon =8, and () oal number of ems N =25. To ge assocaon rules from he generaed 532-ransaconal daabase, mnmum suppor and mnmum confdence were se o 18% and 78%, respecvely. Table 10 lss he dscovered assocaon rules and nfrequen ems. Before geng oulers, he redundan assocaon rules check was performed and no redundan rules were found n hs expermen. If he mnmum ouler degree s se o 10%, 96 oulers are deeced by employng convenonal algorhm [26], compared o 106 oulers by employng he approach proposed heren. Table 11 shows he number of oulers deeced wh dfferen mnmum ouler degree sengs. The proposed algorhm can prune more ransacons and fnd more oulers han he convenonal mehods. I s mporan o fnd any possble oulers ha may nduce he ransacons o be abnormal. Nex, he 106 oulers were aken as he esng se o fnd whch ems cause he oulers o be marked as abnormal. Frs, each ouler was ransformed no absen frequen emses and nfrequen ems, and hen he assocaon rules mnng was appled o he ransformed se. Table 12 shows paral ransformed resul. In order o dscover he ems ha cause he ransacons o be marked as abnormal, several sengs were explored and was found ha wh mnmum suppor and mnmum confdence beng se o 5% and 50%, respecvely, he rule {j} -> {f,, m*} s found. Ths means ha he nfrequen em j causes ransacons o be marked as abnormal. Table 10. Assocaon rules derved from he 532 ransacons generaed by daa generaor. Assocaon rules Infrequen ems {d, } {m} h, j, l, n, o, q,, u, x {d, v} {m} {f, b} {m} {f, } {m} {b, c} {m} {b, v} {m} {b,, v} {m} {b,, m} {v} Table 11. Oulers dscovered from he ransaconal daabase. No. of ransacons pruned No. of oulers dscovered mn. ouler degree prevous [26] our prevous [26] our 10% % Table 12. Paral ransformed daa se for he 106 oulers. TID Iems 9 b//m/v*, j, u 26 d/v/m*, j 39 b//m/v* 89 d//m*, d/v/m*, b/v/m*, b//v/m*, The second expermen uses Wsconsn breas cancer daa se from UCI Machne Learnng Reposory [37]. In order o check he effcency, accuracy and precson raes are defned as follows: no. of deeced oulers ha are posve accuracy. (7) no. of all oulers no. of deeced oulers ha are posve precson. (8) no. of deeced oulers The orgnal Wsconsn breas cancer daa se conans 699 records wh 458 labeled as bengn and 241 labeled as malgnan. Each record has 9 arbues and one class arbue. The arbue nformaon s shown n Table 13. Among he 11

12 699 records, 14 bengn records and 2 malgnan records conanng unknown daa are dscarded. To form an unbalanced daa se, he expermen follows he sraegy oulned [12], namely removng anoher 200 malgnan records. The fnal es daa conans 444 bengn records and 39 malgnan records. We assume he 39 malgnan records are rue abnormal records. We also assume ha some arbues may cause ceran records o be abnormal. In order o derve relaonshps beween arbues, each record s ransformed no a mul-caegorcal arbues ransacon and hen he assocaon rules algorhm can be appled o hs ransacon se o ge assocaon rules. For example, f he frs arbue value s 5, wll be labeled as a5. If he second arbue value s 1, wll be labeled as b1 (see Table 14). The ransacon wh class arbue o2 s a bengn record, whereas o4 s a malgnan record. The hrd sep nvolves mnng assocaon rules from he es daa. Snce he goal s o deec malgnan records ha are hough as oulers, only rules wh consequen par o2 are kep. Table 15 shows he assocaon rules dscovered from he ransformed daa se wh mnmum suppor and confdence se o 75% and 85%, respecvely. Noe ha accordng o Defnon 6 all k-em rules wh k greaer han 2 are redundan and are no lsed n Table 15. The op-k hghes ouler degrees are chosen as oulers, ha s, an ouler s no decded by comparng s ouler degree wh he pre-defned mnmum ouler degree. Table 16 lss op-10, op-20, op-40, and op-60 rue ouler number deeced wh correspondng accuracy and precson rae. Accordng o Table 16, he proposed algorhm yelds beer accuracy and precson raes han prevous approach. The las sep of he second expermen nvolves fndng nfrequen ems ha cause oulers o be marked as abnormal. The expermen res o fnd ems n he op-40 resul. Agan, each ouler n he op-40 resul s ransformed no absen frequen emses and nfrequen ems. The mnmum suppor and mnmum confdence are se o 50% and 80%, respecvely. Only rule {fa} -> {b1, o2*} s found. Tha s, he em fa (he arbue Bare Nucle s 10) may be he reason ha caused a paen s umor o be malgnan, alhough he arbue Unformy of Cell Sze s 1. The hrd expermen uses Parknson s elemonorng daa se from UCI Machne Learnng Reposory [37]. There are a oal of 5,875 records n he daa se, and each record has 19 arbues capurng 16 voce measures, gender, moor-updrs score, and oal-updrs score [31]. Each arbue s quanave and needs o be dscrezed, or dvded no several nervals, before he assocaon rules mnng algorhm s appled. Each arbue, excep gender, s dvded no hree non-nerseced nervals, hgh, medum and low. For example, age ranges from 36 o 85, and a subjec older han 74 belongs o he hgh nerval, below 50 belongs o he low and ohers belong o he medum range. Afer dscrezng he arbues, one can proceed o dscover he assocaon rules from he Parknson s daa se. In hs expermen, a record wh moor-updrs_medum s assumed o be a normal record whle a record wh moor-updrs_hgh s reaed as a possble ouler record. Smlar o he second expermen, only rules wh consequen par of moor-updrs_medum are kep. Table 17 shows paral assocaon rules dscovered from he ransformed daa se wh mnmum suppor and confdence of 8% and 75%, respecvely. No k-em rule wh k less han 4 has consequen par of moor-updrs_medum, and accordng o Defnon 6, all k-em rules wh k greaer han 4 ha have consequen par of moor-updrs_medum are redundan. Table 18 lss dscovered nfrequen ems. The second sep nvolves fndng oulers by comparng he Parknson s daa se wh he dscovered rules. Several possble ouler records are found and f he mnmum ouler degree s se o 0.05, he 4 records n Table 19 wll be rue oulers. These 4 records should have arbue value moor-updrs_medum accordng o he assocaon rules dscovered, bu hey have moor-updrs_hgh. To fnd nfrequen ems ha cause he records o become abnormal, each ouler n Table 19 s ransformed no wo pars, absen frequen emses and nfrequen ems. Agan, by applyng he assocaon rules mnng algorhm wh a mnmum suppor of 50% and a mnmum confdence of 50%, he nfrequen em RPDE_low (a nonlnear dynamcal complexy measure below 0.347) s denfed as he source ha causes he records o become abnormal. Tha s, when all he measuremens are n medum or low nervals, he RPDE measure s he key o ell a healhy subjec apar from a Parknson s paen. These expermens show ha he proposed ouler deecon mehod s more praccal han prevous approaches snce no only denfes he ouler ransacons, bu also dscovers he assocaons beween ouler and s nfrequen ems. Ths s useful snce he mnng resuls help users deermne wha causes ransacons o become abnormal whou havng consuled exper knowledge n advance. Table 13. The arbue nformaon for Wsconsn breas cancer daa se. arbue ID arbue nformaon doman 1 Clump Thckness Unformy of Cell Sze Unformy of Cell Shape Margnal Adheson Sngle Ephelal Cell Sze Bare Nucle

13 7 Bland Chroman Normal Nucleol Moses Class 2 for bengn, 4 for malgnan Table 14. Paral ransformed daa se n ransacon forma. Orgnal nsances (9 caegorcal arbues n Transformed resul each record) <5, 1, 1, 1, 2, 1, 3, 1, 1, 2> <a5, b1, c1, d1, e2, f1, g3, h1, 1, o2> <5, 4, 4, 5, 7, 10, 3, 2, 1, 2> <a5, b4, c4, d5, e7, fa, g3, h2, 1, o2> <3, 1, 1, 1, 2, 2, 3, 1, 1, 2> <a3, b1, c1, d1, e2, f2, g3, h1, 1, o2> <9, 1, 2, 6, 4, 10, 7, 7, 2, 4> <a9, b1, c2, d6, e4, fa, g7, h7, 2, o4> Table 15. Assocaon rules mned from Wsconsn breas cancer daa se. 2-em rules (suppor, confdence) {b1} -> {o2} (76.4%, 100%) {f1} -> {o2} (80.1%, 98.7%) {h1} -> {o2} (81.0%, 98.5%) {1} -> {o2} (89.2%, 95.1%) Table 16. Ouler decon under dfferen k values. Top-k prevous (accuracy, precson) [26] our (accuracy, precson) op-10 6 (15%, 60%) 8 (21%, 80%) op (38%, 75%) 17 (44%, 85%) op (87%, 85%) 35 (90%, 88%) op (100%, 65%) 39 (100%, 65%) Table 17. Paral assocaon rules dscovered from Parknson s daa se (Mn Suppor=8%, Mn Confdence=75%). Aneceden par for Consequen par for dscovered rule dscovered rule Male, shmmer_low, moor-updrs_medum db_low Male, shmmer_low, moor-updrs_medum APQ3_low Male, age_medum, moor-updrs_medum db_low Male, db_low, moor-updrs_medum RPDE_medum Male, APQ3_low, moor-updrs_medum DFA_medum Table 18. Infrequen ems dscovered from Parknson s daa se (Mn Suppor=8%, Mn Confdence=75%). age_low, jer_hgh, Abs_hgh, RAP_hgh, PPQ5_hgh, DDP_hgh, shmmer_hgh, db_hgh, APQ3_hgh,APQ5_hgh, APQ11_hgh, DDA_hgh, NHR_hgh, HNR_low, HNR_hgh, RPDE_low, RPDE_hgh, DFA_hgh, PPE_hgh 13

14 Table 19. The 4 oulers dscovered by comparng he Parknson s daa se wh dscovered assocaon rules. No. Ouler records Ouler degrees 1 age_low, Male, moor-updrs_hgh, jer_low, Abs_medum, RAP_low, PPQ5_low, 1/19 DDP_low, shmmer_low, db_low, APQ3_low, APQ5_low, ln,dda_low, NHR_low, HNR_medum, RPDE_medum, DFA_medum, PPE_medum 2 age_medum, Male, moor-updrs_hgh, jer_low, Abs_low, RAP_low, PPQ5_low, 1/19 DDP_low, shmmer_low, db_low, APQ3_low, APQ5_low, APQ11_low, DDA_low, NHR_low, HNR_hgh, RPDE_medum, DFA_low, PPE_low 3 age_medum, Male, moor-updrs_hgh, jer_low, Abs_low, RAP_low, PPQ5_low, 1/19 DDP_low, shmmer_low, db_low, APQ3_low, APQ5_low, APQ11_low, DDA_low, NHR_low, HNR_medum, RPDE_low, DFA_low, PPE_medum 4 age_medum, Male, moor-updrs_hgh, jer_low, Abs_low, RAP_low, PPQ5_low, DDP_low, shmmer_low, db_low, APQ3_low, APQ5_low, APQ11_low, DDA_low, NHR_low, HNR_medum, RPDE_low, DFA_low, PPE_medum 1/19 6 Conclusons From he perspecve of ouler managemen, convenonal mehods dd no ackle he queson on how o furher ulze he deeced oulers. The proposed framework can fnd he nfrequen ems ha nduce he ransacons o be abnormal. To preven he nfrequen ems from devang from he rue ouler degrees he proposed mehod modfed he defnon of ransacon s assocaon closure by removng he nfrequen ems before he calculaon of ouler degrees. Afer denfyng he oulers, he proposed approach furher dscovers whch nfrequen ems make ransacons abnormal. Abnormal ransacons are dvded no absen frequen emses and nfrequen ems. By applyng assocaon rule mnng mehod, he relaonshp beween absen frequen emses and nfrequen ems are found. Iems ha cause he ransacons o become oulers are herefore found and he mnng resuls are easer o undersand. The proposed framework provdes a oal soluon no only on fndng bu also on managng oulers. The expermenal resuls verfy ha he proposed algorhm s more effcen boh n erms of accuracy and precson raes. Fuure mprovemens are possble. The calculaon of ouler degree reles on assocave closure. However, he confdence values of assocaon rules should be consdered. Tha s, f a ransacon volaes a hgher confdence rule, should have hgher ouler degree. Nex, he preceden pars of assocaon rules affec he ouler deecon. Snce he proposed algorhm employs non-redundan rules o check ransacons, he fnal resul may nclude many known oulers, and even he nfrequen ems ha cause abnormal oulers are revealed. In hs case, seng a mnmum em number for preceden par may solve he problem. We frs apply he framework o healh care daa o verfy he algorhm s feasbly. In he fuure, s necessary o acqure more real world daa from dfferen sources o derve abnormal ransacons and fnd reasons behnd he abnormaly. The mnng resuls wll also be shared wh hospal offcals o nqure her opnons. I s mporan o menon ha he proposed framework can also be appled o any knd of ransacon daa se o fnd whch nfrequen ems nduce ransacons o be abnormal. There are a varey of reasons ha can lead o abnormaly, and hs sudy s conrbuon s o provde a way o denfy he sources of confuson. The nfrequen ems are always gnored n daa mnng bu now hey may provde valuable nformaon o allow people o make beer decson. Acknowledgmens Ths work was suppored n par by Mnsry of Scence and Technology, Tawan under Grans NSC E and NSC E MY2, and n par by jon projec beween Naonal Tape Unversy of Technology and Mackay Memoral Hospal under Gran NTUT-MMH and Gran NTUT-MMH References [1] Angull F, Pzzu C (2002) Fas ouler deecon n hgh dmensonal spaces. In: Proceedngs of he 6h European Conference on Prncples of Daa Mnng and Knowledge Dscovery n Daabases. Helsnk, Fnland: [2] Angull F, Pzzu C (2005) Ouler mnng n large hgh-dmensonal daa ses. IEEE Trans on Knowledge and Daa Engneerng 17: [3] Bahrampour S, Moshr B, Salahshoor K (2011) Weghed and consraned possblsc C-means cluserng for onlne faul deecon and solaon. Appled Inellgence 35(2): [4] Bhadur K, Mahews BL, Gannella CR (2011) Algorhms for speedng up dsance-based ouler deecon. In: Proceedngs of ACM SIGKDD In. Conf. on Knowledge Dscovery and Daa Mnng, San Dego, CA, USA: [5] Burdck D, Calmlm M, Flannck J, Gehrke J, Yu T (2005) MAFIA: A mal frequen emse algorhm. IEEE Trans on Knowledge and Daa Engneerng 17:

15 [6] Chandola V, Banerjee A, Kumar V (2009) Anomaly deecon: A survey. ACM Compung Surveys 41:1-58. [7] Chazard E, Fcheur G, Bernonvlle S, Luyckx M, Beuscar R (2011) Daa mnng o generae adverse drug evens deecon rules. IEEE Trans on Informaon Technology n Bomedcne 15: [8] Elah M, L K, Nsar W, Lv X, Wang H (2008) Effcen cluserng-based ouler deecon algorhm for dynamc daa sream. In: Proceedngs of he 5h In. Conf. on Fuzzy Sysems and Knowledge Dscovery, Jnan, Shandong, Chna 5: [9] Gouda K, Zak MJ (2001) Effcenly mnng mal frequen emses. In: Proceedngs of IEEE In. Conf. on Daa Mnng, San Jose, Calforna, USA: [10] Grahne G, Zhu J (2003) Hgh performance mnng of mal frequen emses. In: Proceedngs of he 6h SIAM Workshop on Hgh Performance Daa Mnng, San Francsco, CA, USA: [11] Grahne G, Zhu JF (2005) Fas algorhms for frequen em se mnng usng FP-Trees. IEEE Trans on Knowledge and Daa Engneerng 17: [12] Guo T, L GY (2008) Neural daa mnng for cred card fraud deecon. In: Proceedngs of he 7h In. Conf. on Machne Learnng and Cybernecs, Kunmng, Chna 7: [13] Hagln DJ, Mannng AM (2007) On mnmal nfrequen emse mnng. In: Proceedngs of he In. Conf. on Daa Mnng, Las Vegas, Nevada, USA: [14] Han J, Pe J, Yn Y (2000) Mnng frequen paerns whou canddae generaon. In: Proceedngs of ACM SIGMOD In. Conf. on Managemen of Daa, Dallas, Texas, USA: [15] He Z, Deng S, Xu X (2005) An opmzaon model for ouler deecon n caegorcal daa. In: Proceedngs of IEEE In. Conf. on Inellgen Compung, Hefe, Chna: [16] He Z, Deng S, Xu X (2006) A fas greedy algorhm for ouler mnng. In: Proceedngs of he 10h Pacfc-Asa Conf. on Knowledge Dscovery and Daa Mnng, Sngapore: [17] He Z, Xu X, Deng S (2005) Fp-ouler: Frequen paern based ouler deecon. Compuer Scence and Informaon Sysem 2: [18] Hdo S, Tsubo Y, Kashma H, Sugyama M, Kanamor T (2011) Sascal ouler deecon usng drec densy rao esmaon. Knowledge and Informaon Sysems 26: [19] Hu T, Sung SY, Xong H, Fu Q (2008) Dscovery of mum lengh frequen emses. Informaon Scences 178: [20] Huang Y-P, Kao LJ, Sandnes FE (2008) Effcen mnng of salny and emperaure assocaon rules from ARGO daa. Exper Sysems wh Applcaons 35: [21] Koufakou A, Georgopoulos M, Anagnosopoulos GC, Reynolds KM (2007) A scalable and effcen ouler deecon sraegy for caegorcal daa. In: Proceedngs of IEEE In. Conf. on Tools wh Arfcal Inellgence, Paras, Greece: [22] Koufakou A, Georgopoulos M (2010) A fas ouler deecon sraegy for dsrbued hgh-dmensonal daa ses wh mxed arbues. Daa Mnng and Knowledge Dscovery 20: [23] Kregel HP, Kröger P, Zmek A (2009) Cluserng hgh-dmensonal daa: A survey on subspace cluserng, paern-based cluserng, and correlaon cluserng. ACM Transacons on Knowledge Dscovery from Daa 3:1-58. [24] Le D, Zhu QH, Chen J, Ln H, Yang P (2012) Auomac PAM cluserng algorhm for ouler deecon. Journal of Sofware 7: [25] Márquez-Vera C, Morales CR, Soo SV (2013) Predcng school falure and dropou by usng daa mnng echnques. IEEE Journal of Lan- Amercan Learnng Technologes 8:7-14. [26] Nara K, Kagawa H (2008) Ouler deecon for ransacon daabases usng assocaon rules. In: Proceedngs of he 9h In. Conf. on Web-Age Informaon Managemen, Zhangjaje, Hunan, Chna: [27] Oey ME, Ghong A, Parhasarahy A (2006) Fas dsrbued ouler deecon n mxed-arbue daa ses. Daa Mnng and Knnowledge Dscovery 12: [28] Papadmrou S, Kagawa H, Gbbons PB, Falousos C (2003) Loc: Fas ouler deecon usng he local correlaon negral. In: Proceedngs of he 19h In. Conf. on Daa Engneerng, Bangalore, Inda: [29] Sh K, L L (2013) Hgh performance genec algorhm based ex cluserng usng pars of speech and ouler elmnaon. Appled Inellgence 38(4): [30] Troano L, Scbell G (2014) Mnng frequen emses n daa sreams whn a me horzon. Daa & Knowledge Engneerng 89: [31] Tsanas A, Lle MA, McSharry PE, Ramg LO (2010) Accurae elemonorng of Parknson s dsease progresson by non-nvasve speech ess. IEEE Transacons on Bomedcal Engneerng 57: [32] Tseng VS, She B-E, Wu C-W, Yu PS (2013) Effcen algorhms for mnng hgh uly emses from ransaconal daabases. IEEE Trans on Knowledge and Daa Engneerng 25: [33] Wu X, Kumar V, Ross Qunlan J, Ghosh J, Yang Q, Mooda H, McLachlan G, Ng A, Lu B, Yu P, Zhou Z-H, Senbach M, Hand D, Senberg D (2008) Top 10 algorhms n daa mnng. Knowledge and Informaon Sysems 14:1-37. [34] Wan Y, Ban F (2008) Cell-based ouler deecon algorhm: A fas ouler deecon algorhm for large daases. In: Proceedngs of he 12h Pacfc- Asa Conference on Knowledge Dscovery and Daa Mnng, Osaka, Japan 5012: [35] Yanqng J, Hao Y, Peer D, Ayman M, John T, Rchard ME, Massanar R-M (2011) A poenal causal assocaon mnng algorhm for screenng adverse drug reacons n posmarkeng survellance. IEEE Trans on Informaon Technology n Bomedcne 15: [36] Zhu C, Kagawa H, Falousos C (2005) Example-based robus ouler deecon n hgh dmensonal daases. In: Proceedngs of he 5h IEEE In. Conf. on Daa Mnng, Houson, Texas, USA: [37] UCI machne learnng reposory. hp:// mlearn/mlreposory.hml. 15

FITTING EXPONENTIAL MODELS TO DATA Supplement to Unit 9C MATH Q(t) = Q 0 (1 + r) t. Q(t) = Q 0 a t,

FITTING EXPONENTIAL MODELS TO DATA Supplement to Unit 9C MATH Q(t) = Q 0 (1 + r) t. Q(t) = Q 0 a t, FITTING EXPONENTIAL MODELS TO DATA Supplemen o Un 9C MATH 01 In he handou we wll learn how o fnd an exponenal model for daa ha s gven and use o make predcons. We wll also revew how o calculae he SSE and

More information

A Change Detection Model for Credit Card Usage Behavior

A Change Detection Model for Credit Card Usage Behavior Proceedngs of he 5h WSEAS In. Conf. on COMPUTATIONAL INTELLIGENCE, MAN-MACHINE SYSTEMS AND CYBERNETICS, Vence, Ialy, November 20-22, 2006 276 A Change Deecon Model for Cred Card Usage Behavor CHIEH-YUAN

More information

Section 6 Short Sales, Yield Curves, Duration, Immunization, Etc.

Section 6 Short Sales, Yield Curves, Duration, Immunization, Etc. More Tuoral a www.lledumbdocor.com age 1 of 9 Secon 6 Shor Sales, Yeld Curves, Duraon, Immunzaon, Ec. Shor Sales: Suppose you beleve ha Company X s sock s overprced. You would ceranly no buy any of Company

More information

Chain-linking and seasonal adjustment of the quarterly national accounts

Chain-linking and seasonal adjustment of the quarterly national accounts Sascs Denmark Naonal Accouns 6 July 00 Chan-lnkng and seasonal adjusmen of he uarerly naonal accouns The mehod of chan-lnkng he uarerly naonal accouns was changed wh he revsed complaon of daa hrd uarer

More information

Noise and Expected Return in Chinese A-share Stock Market. By Chong QIAN Chien-Ting LIN

Noise and Expected Return in Chinese A-share Stock Market. By Chong QIAN Chien-Ting LIN Nose and Expeced Reurn n Chnese A-share Sock Marke By Chong QIAN Chen-Tng LIN 1 } Capal Asse Prcng Model (CAPM) by Sharpe (1964), Lnner (1965) and Mossn (1966) E ( R, ) R f, + [ E( Rm, ) R f, = β ] + ε

More information

Improving Forecasting Accuracy in the Case of Intermittent Demand Forecasting

Improving Forecasting Accuracy in the Case of Intermittent Demand Forecasting (IJACSA) Inernaonal Journal of Advanced Compuer Scence and Applcaons, Vol. 5, No. 5, 04 Improvng Forecasng Accuracy n he Case of Inermen Demand Forecasng Dasuke Takeyasu The Open Unversy of Japan, Chba

More information

Normal Random Variable and its discriminant functions

Normal Random Variable and its discriminant functions Normal Random Varable and s dscrmnan funcons Oulne Normal Random Varable Properes Dscrmnan funcons Why Normal Random Varables? Analycally racable Works well when observaon comes form a corruped sngle prooype

More information

Network Security Risk Assessment Based on Node Correlation

Network Security Risk Assessment Based on Node Correlation Journal of Physcs: Conference Seres PAPER OPE ACCESS ewor Secury Rs Assessmen Based on ode Correlaon To ce hs arcle: Zengguang Wang e al 2018 J. Phys.: Conf. Ser. 1069 012073 Vew he arcle onlne for updaes

More information

Batch Processing for Incremental FP-tree Construction

Batch Processing for Incremental FP-tree Construction Inernaonal Journal of Compuer Applons (975 8887) Volume 5 No.5, Augus 21 Bach Processng for Incremenal FP-ree Consrucon Shashkumar G. Toad Deparmen of CSE, GMRIT, Rajam, Srkakulam Dsrc AndraPradesh, Inda.

More information

UNN: A Neural Network for uncertain data classification

UNN: A Neural Network for uncertain data classification UNN: A Neural Nework for unceran daa classfcaon Jaq Ge, and Yun Xa, Deparmen of Compuer and Informaon Scence, Indana Unversy Purdue Unversy, Indanapols, USA {jaqge, yxa }@cs.upu.edu Absrac. Ths paper proposes

More information

Accuracy of the intelligent dynamic models of relational fuzzy cognitive maps

Accuracy of the intelligent dynamic models of relational fuzzy cognitive maps Compuer Applcaons n Elecrcal Engneerng Accuracy of he nellgen dynamc models of relaonal fuzzy cognve maps Aleksander Jasrebow, Grzegorz Słoń Kelce Unversy of Technology 25-314 Kelce, Al. Tysącleca P. P.

More information

Baoding, Hebei, China. *Corresponding author

Baoding, Hebei, China. *Corresponding author 2016 3 rd Inernaonal Conference on Economcs and Managemen (ICEM 2016) ISBN: 978-1-60595-368-7 Research on he Applcably of Fama-French Three-Facor Model of Elecrc Power Indusry n Chnese Sock Marke Yeld

More information

Correlation of default

Correlation of default efaul Correlaon Correlaon of defaul If Oblgor A s cred qualy deeroraes, how well does he cred qualy of Oblgor B correlae o Oblgor A? Some emprcal observaons are efaul correlaons are general low hough hey

More information

Improving Earnings per Share: An Illusory Motive in Stock Repurchases

Improving Earnings per Share: An Illusory Motive in Stock Repurchases Inernaonal Journal of Busness and Economcs, 2009, Vol. 8, No. 3, 243-247 Improvng Earnngs per Share: An Illusory Move n Sock Repurchases Jong-Shn We Deparmen of Inernaonal Busness Admnsraon, Wenzao Ursulne

More information

An Inclusion-Exclusion Algorithm for Network Reliability with Minimal Cutsets

An Inclusion-Exclusion Algorithm for Network Reliability with Minimal Cutsets Amercan Journal of ompuaonal Mahemacs, 202, 2, 6-20 hp://dxdoorg/0426/acm2022404 Publshed Onlne December 202 (hp://wwwscrporg/ournal/acm) An Incluson-Excluson Algorhm for ework Relably wh Mnmal uses Yan-Ru

More information

Dynamic Relationship and Volatility Spillover Between the Stock Market and the Foreign Exchange market in Pakistan: Evidence from VAR-EGARCH Modelling

Dynamic Relationship and Volatility Spillover Between the Stock Market and the Foreign Exchange market in Pakistan: Evidence from VAR-EGARCH Modelling Dynamc Relaonshp and Volaly pllover Beween he ock Marke and he Foregn xchange marke n Paksan: vdence from VAR-GARCH Modellng Dr. Abdul Qayyum Dr. Muhammad Arshad Khan Inroducon A volale sock and exchange

More information

Deriving Reservoir Operating Rules via Fuzzy Regression and ANFIS

Deriving Reservoir Operating Rules via Fuzzy Regression and ANFIS Dervng Reservor Operang Rules va Fuzzy Regresson and ANFIS S. J. Mousav K. Ponnambalam and F. Karray Deparmen of Cvl Engneerng Deparmen of Sysems Desgn Engneerng Unversy of Scence and Technology Unversy

More information

SkyCube Computation over Wireless Sensor Networks Based on Extended Skylines

SkyCube Computation over Wireless Sensor Networks Based on Extended Skylines Proceedngs of he 2010 IEEE Inernaonal Conference on Informaon and Auomaon June 20-23, Harbn, Chna SkyCube Compuaon over Wreless Sensor Neworks Based on Exended Skylnes Zhqong Wang 1, Zhyue Wang 2, Junchang

More information

Return Calculation Methodology

Return Calculation Methodology Reurn Calculaon Mehodology Conens 1. Inroducon... 1 2. Local Reurns... 2 2.1. Examle... 2 3. Reurn n GBP... 3 3.1. Examle... 3 4. Hedged o GBP reurn... 4 4.1. Examle... 4 5. Cororae Acon Facors... 5 5.1.

More information

The UAE UNiversity, The American University of Kurdistan

The UAE UNiversity, The American University of Kurdistan MPRA Munch Personal RePEc Archve A MS-Excel Module o Transform an Inegraed Varable no Cumulave Paral Sums for Negave and Posve Componens wh and whou Deermnsc Trend Pars. Abdulnasser Haem-J and Alan Musafa

More information

Explaining Product Release Planning Results Using Concept Analysis

Explaining Product Release Planning Results Using Concept Analysis Explanng Produc Release Plannng Resuls Usng Concep Analyss Gengshen Du, Thomas Zmmermann, Guenher Ruhe Deparmen of Compuer Scence, Unversy of Calgary 2500 Unversy Drve NW, Calgary, Albera T2N 1N4, Canada

More information

The Financial System. Instructor: Prof. Menzie Chinn UW Madison

The Financial System. Instructor: Prof. Menzie Chinn UW Madison Economcs 435 The Fnancal Sysem (2/13/13) Insrucor: Prof. Menze Chnn UW Madson Sprng 2013 Fuure Value and Presen Value If he presen value s $100 and he neres rae s 5%, hen he fuure value one year from now

More information

Pricing and Valuation of Forward and Futures

Pricing and Valuation of Forward and Futures Prcng and Valuaon of orward and uures. Cash-and-carry arbrage he prce of he forward conrac s relaed o he spo prce of he underlyng asse, he rsk-free rae, he dae of expraon, and any expeced cash dsrbuons

More information

Differences in the Price-Earning-Return Relationship between Internet and Traditional Firms

Differences in the Price-Earning-Return Relationship between Internet and Traditional Firms Dfferences n he Prce-Earnng-Reurn Relaonshp beween Inerne and Tradonal Frms Jaehan Koh Ph.D. Program College of Busness Admnsraon Unversy of Texas-Pan Amercan jhkoh@upa.edu Bn Wang Asssan Professor Compuer

More information

Mind the class weight bias: weighted maximum mean discrepancy for unsupervised domain adaptation. Hongliang Yan 2017/06/21

Mind the class weight bias: weighted maximum mean discrepancy for unsupervised domain adaptation. Hongliang Yan 2017/06/21 nd he class wegh bas: weghed maxmum mean dscrepancy for unsupervsed doman adapaon Honglang Yan 207/06/2 Doman Adapaon Problem: Tranng and es ses are relaed bu under dfferen dsrbuons. Tranng (Source) DA

More information

Fugit (options) The terminology of fugit refers to the risk neutral expected time to exercise an

Fugit (options) The terminology of fugit refers to the risk neutral expected time to exercise an Fug (opons) INTRODUCTION The ermnology of fug refers o he rsk neural expeced me o exercse an Amercan opon. Invened by Mark Garman whle professor a Berkeley n he conex of a bnomal ree for Amercan opon hs

More information

Michał Kolupa, Zbigniew Śleszyński SOME REMARKS ON COINCIDENCE OF AN ECONOMETRIC MODEL

Michał Kolupa, Zbigniew Śleszyński SOME REMARKS ON COINCIDENCE OF AN ECONOMETRIC MODEL M I S C E L L A N E A Mchał Kolupa, bgnew Śleszyńsk SOME EMAKS ON COINCIDENCE OF AN ECONOMETIC MODEL Absrac In hs paper concep of concdence of varable and mehods for checkng concdence of model and varables

More information

IFX-Cbonds Russian Corporate Bond Index Methodology

IFX-Cbonds Russian Corporate Bond Index Methodology Approved a he meeng of he Commee represenng ZAO Inerfax and OOO Cbonds.ru on ovember 1 2005 wh amendmens complan wh Agreemen # 545 as of ecember 17 2008. IFX-Cbonds Russan Corporae Bond Index Mehodology

More information

A valuation model of credit-rating linked coupon bond based on a structural model

A valuation model of credit-rating linked coupon bond based on a structural model Compuaonal Fnance and s Applcaons II 247 A valuaon model of cred-rang lnked coupon bond based on a srucural model K. Yahag & K. Myazak The Unversy of Elecro-Communcaons, Japan Absrac A cred-lnked coupon

More information

Gaining From Your Own Default

Gaining From Your Own Default Ganng From Your Own Defaul Jon Gregory jon@ofranng.com Jon Gregory (jon@ofranng.com), Quan ongress US, 14 h July 2010 page 1 Regulaon s Easy () Wha don lke as a regulaor? Dfferen nsuons valung asses dfferenly

More information

Optimal Fuzzy Min-Max Neural Network (FMMNN) for Medical Data Classification Using Modified Group Search Optimizer Algorithm

Optimal Fuzzy Min-Max Neural Network (FMMNN) for Medical Data Classification Using Modified Group Search Optimizer Algorithm 1 Opmal Fuzzy Mn-Max Neural Nework (FMMNN) for Medcal Daa Classfcaon Usng Modfed Group Search Opmzer Algorhm D. Mahammad Raf 1 * Chear Ramachandra Bharah 2 1 Vvekananda Insue of Engneerng & Technology,

More information

Online Technical Appendix: Estimation Details. Following Netzer, Lattin and Srinivasan (2005), the model parameters to be estimated

Online Technical Appendix: Estimation Details. Following Netzer, Lattin and Srinivasan (2005), the model parameters to be estimated Onlne Techncal Appendx: Esmaon Deals Followng Nezer, an and Srnvasan 005, he model parameers o be esmaed can be dvded no hree pars: he fxed effecs governng he evaluaon, ncdence, and laen erence componens

More information

Prediction of Oil Demand Based on Time Series Decomposition Method Nan MA * and Yong LIU

Prediction of Oil Demand Based on Time Series Decomposition Method Nan MA * and Yong LIU 2017 2nd Inernaonal Conference on Sofware, Mulmeda and Communcaon Engneerng (SMCE 2017) ISBN: 978-1-60595-458-5 Predcon of Ol Demand Based on Tme Seres Decomposon Mehod Nan MA * and Yong LIU College of

More information

STOCK PRICES TEHNICAL ANALYSIS

STOCK PRICES TEHNICAL ANALYSIS STOCK PRICES TEHNICAL ANALYSIS Josp Arnerć, Elza Jurun, Snježana Pvac Unversy of Spl, Faculy of Economcs Mace hrvaske 3 2 Spl, Croaa jarnerc@efs.hr, elza@efs.hr, spvac@efs.hr Absrac Ths paper esablshes

More information

VI. Clickstream Big Data and Delivery before Order Making Mode for Online Retailers

VI. Clickstream Big Data and Delivery before Order Making Mode for Online Retailers VI. Clcksream Bg Daa and Delvery before Order Makng Mode for Onlne Realers Yemng (Yale) Gong EMLYON Busness School Haoxuan Xu *, Jnlong Zhang School of Managemen, Huazhong Unversy of Scence &Technology

More information

Estimation of Optimal Tax Level on Pesticides Use and its

Estimation of Optimal Tax Level on Pesticides Use and its 64 Bulgaran Journal of Agrculural Scence, 8 (No 5 0, 64-650 Agrculural Academy Esmaon of Opmal Ta Level on Pescdes Use and s Impac on Agrculure N. Ivanova,. Soyanova and P. Mshev Unversy of Naonal and

More information

A Multi-Periodic Optimization Modeling Approach for the Establishment of a Bike Sharing Network: a Case Study of the City of Athens

A Multi-Periodic Optimization Modeling Approach for the Establishment of a Bike Sharing Network: a Case Study of the City of Athens A Mul-Perodc Opmzaon Modelng Approach for he Esablshmen of a Be Sharng Newor: a Case Sudy of he Cy of Ahens G.K.D Sahards, A. Fragogos and E. Zygour Absrac Ths sudy nroduces a novel mahemacal formulaon

More information

An improved segmentation-based HMM learning method for Condition-based Maintenance

An improved segmentation-based HMM learning method for Condition-based Maintenance An mproved segmenaon-based HMM learnng mehod for Condon-based Manenance T Lu 1,2, J Lemere 1,2, F Carella 1,2 and S Meganck 1,3 1 ETRO Dep., Vre Unverse Brussel, Plenlaan 2, 1050 Brussels, Belgum 2 FMI

More information

A Novel Approach to Model Generation for Heterogeneous Data Classification

A Novel Approach to Model Generation for Heterogeneous Data Classification A Novel Approach o Model Generaon for Heerogeneous Daa Classfcaon Rong Jn*, Huan Lu *Dep. of Compuer Scence and Engneerng, Mchgan Sae Unversy, Eas Lansng, MI 48824 rongn@cse.msu.edu Deparmen of Compuer

More information

Methodology of the CBOE S&P 500 PutWrite Index (PUT SM ) (with supplemental information regarding the CBOE S&P 500 PutWrite T-W Index (PWT SM ))

Methodology of the CBOE S&P 500 PutWrite Index (PUT SM ) (with supplemental information regarding the CBOE S&P 500 PutWrite T-W Index (PWT SM )) ehodology of he CBOE S&P 500 PuWre Index (PUT S ) (wh supplemenal nformaon regardng he CBOE S&P 500 PuWre T-W Index (PWT S )) The CBOE S&P 500 PuWre Index (cker symbol PUT ) racks he value of a passve

More information

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory

Online appendices from Counterparty Risk and Credit Value Adjustment a continuing challenge for global financial markets by Jon Gregory Onlne appendces fro Counerpary sk and Cred alue Adusen a connung challenge for global fnancal arkes by Jon Gregory APPNDX A: Dervng he sandard CA forula We wsh o fnd an expresson for he rsky value of a

More information

SOCIETY OF ACTUARIES FINANCIAL MATHEMATICS. EXAM FM SAMPLE SOLUTIONS Interest Theory

SOCIETY OF ACTUARIES FINANCIAL MATHEMATICS. EXAM FM SAMPLE SOLUTIONS Interest Theory SOCIETY OF ACTUARIES EXAM FM FINANCIAL MATHEMATICS EXAM FM SAMPLE SOLUTIONS Ineres Theory Ths page ndcaes changes made o Sudy Noe FM-09-05. January 4, 04: Quesons and soluons 58 60 were added. June, 04

More information

Recursive Data Mining for Masquerade Detection and Author Identification

Recursive Data Mining for Masquerade Detection and Author Identification Recursve Daa Mnng for Masquerade Deecon and Auhor Idenfcaon Boleslaw K. Szymansk, IEEE Fellow, and Yongqang Zhang Deparmen of Compuer Scence, RPI, Troy, NY 280, USA Absrac- In hs paper, a novel recursve

More information

Economics of taxation

Economics of taxation Economcs of axaon Lecure 3: Opmal axaon heores Salane (2003) Opmal axes The opmal ax sysem mnmzes he excess burden wh a gven amoun whch he governmen wans o rase hrough axaon. Opmal axes maxmze socal welfare,

More information

A Hybrid Method to Improve Forecasting Accuracy Utilizing Genetic Algorithm An Application to the Data of Operating equipment and supplies

A Hybrid Method to Improve Forecasting Accuracy Utilizing Genetic Algorithm An Application to the Data of Operating equipment and supplies A Hyrd Mehod o Improve Forecasng Accuracy Ulzng Genec Algorhm An Applcaon o he Daa of Operang equpmen and supples Asam Shara Tax Corporaon Arkne, Shzuoka Cy, Japan, e-mal: a-shara@arkne.nfo Dasuke Takeyasu

More information

PFAS: A Resource-Performance-Fluctuation-Aware Workflow Scheduling Algorithm for Grid Computing

PFAS: A Resource-Performance-Fluctuation-Aware Workflow Scheduling Algorithm for Grid Computing PFAS: A Resource-Performance-Flucuaon-Aware Workflow Schedulng Algorhm for Grd Compung Fangpeng Dong and Selm G. Akl School of Compung, Queen's Unversy Kngson, ON Canada, K7L N6 {dong, akl}@cs.queensu.ca

More information

Lab 10 OLS Regressions II

Lab 10 OLS Regressions II Lab 10 OLS Regressons II Ths lab wll cover how o perform a smple OLS regresson usng dfferen funconal forms. LAB 10 QUICK VIEW Non-lnear relaonshps beween varables nclude: o Log-Ln: o Ln-Log: o Log-Log:

More information

Terms and conditions for the MXN Peso / US Dollar Futures Contract (Physically Delivered)

Terms and conditions for the MXN Peso / US Dollar Futures Contract (Physically Delivered) The Englsh verson of he Terms and Condons for Fuures Conracs s publshed for nformaon purposes only and does no consue legal advce. However, n case of any Inerpreaon conroversy, he Spansh verson shall preval.

More information

Floating rate securities

Floating rate securities Caps and Swaps Floang rae secures Coupon paymens are rese perodcally accordng o some reference rae. reference rae + ndex spread e.g. -monh LIBOR + 00 bass pons (posve ndex spread 5-year Treasury yeld 90

More information

Bank of Japan. Research and Statistics Department. March, Outline of the Corporate Goods Price Index (CGPI, 2010 base)

Bank of Japan. Research and Statistics Department. March, Outline of the Corporate Goods Price Index (CGPI, 2010 base) Bank of Japan Research and Sascs Deparmen Oulne of he Corporae Goods Prce Index (CGPI, 2010 base) March, 2015 1. Purpose and Applcaon The Corporae Goods Prce Index (CGPI) measures he prce developmens of

More information

Albania. A: Identification. B: CPI Coverage. Title of the CPI: Consumer Price Index. Organisation responsible: Institute of Statistics

Albania. A: Identification. B: CPI Coverage. Title of the CPI: Consumer Price Index. Organisation responsible: Institute of Statistics Albana A: Idenfcaon Tle of he CPI: Consumer Prce Index Organsaon responsble: Insue of Sascs Perodcy: Monhly Prce reference perod: December year 1 = 100 Index reference perod: December 2007 = 100 Weghs

More information

Empirical Study on the Relationship between ICT Application and China Agriculture Economic Growth

Empirical Study on the Relationship between ICT Application and China Agriculture Economic Growth Emprcal Sudy on he Relaonshp beween ICT Applcaon and Chna Agrculure Economc Growh Pengju He, Shhong Lu, Huoguo Zheng, and Yunpeng Cu Key Laboraory of Dgal Agrculural Early-warnng Technology Mnsry of Agrculure,

More information

Co-Integration Study of Relationship between Foreign Direct Investment and Economic Growth

Co-Integration Study of Relationship between Foreign Direct Investment and Economic Growth www.ccsene.org/br Inernaonal Busness Research Vol. 4, No. 4; Ocober 2011 Co-Inegraon Sudy of Relaonshp beween Foregn Drec Invesen and Econoc Growh Haao Sun Qngdao Technologcal Unversy, Qngdao 266520, Chna

More information

The Empirical Research of Price Fluctuation Rules and Influence Factors with Fresh Produce Sequential Auction Limei Cui

The Empirical Research of Price Fluctuation Rules and Influence Factors with Fresh Produce Sequential Auction Limei Cui 6h Inernaonal Conference on Sensor Nework and Compuer Engneerng (ICSNCE 016) The Emprcal Research of Prce Flucuaon Rules and Influence Facors wh Fresh Produce Sequenal Aucon Lme Cu Qujng Normal Unversy,

More information

Using Fuzzy-Delphi Technique to Determine the Concession Period in BOT Projects

Using Fuzzy-Delphi Technique to Determine the Concession Period in BOT Projects Usng Fuzzy-Delph Technque o Deermne he Concesson Perod n BOT Projecs Khanzad Mosafa Iran Unversy of Scence and Technology School of cvl engneerng Tehran, Iran. P.O. Box: 6765-63 khanzad@us.ac.r Nasrzadeh

More information

A Novel Application of the Copula Function to Correlation Analysis of Hushen300 Stock Index Futures and HS300 Stock Index

A Novel Application of the Copula Function to Correlation Analysis of Hushen300 Stock Index Futures and HS300 Stock Index A Novel Applcaon of he Copula Funcon o Correlaon Analyss of Hushen3 Sock Index Fuures and HS3 Sock Index Fang WU *, 2, Yu WEI. School of Economcs and Managemen, Souhwes Jaoong Unversy, Chengdu 63, Chna

More information

Fairing of Polygon Meshes Via Bayesian Discriminant Analysis

Fairing of Polygon Meshes Via Bayesian Discriminant Analysis Farng of Polygon Meshes Va Bayesan Dscrmnan Analyss Chun-Yen Chen Insue of Informaon Scence, Academa Snca. Deparmen of Compuer Scence and Informaon Engneerng, Naonal Tawan Unversy. 5, Tawan, Tape, Nankang

More information

Quarterly Accounting Earnings Forecasting: A Grey Group Model Approach

Quarterly Accounting Earnings Forecasting: A Grey Group Model Approach Quarerly Accounng Earnngs Forecasng: A Grey Group Model Approach Zheng-Ln Chen Deparmen of Accounng Zhongnan Unversy of Economcs and Law # Souh Nanhu Road, Wuhan Cy, 430073 Hube People's Republc of Chna

More information

Agricultural and Rural Finance Markets in Transition

Agricultural and Rural Finance Markets in Transition Agrculural and Rural Fnance Markes n Transon Proceedngs of Regonal Research Commee NC-04 S. Lous, Mssour Ocober 4-5, 007 Dr. Mchael A. Gunderson, Edor January 008 Food and Resource Economcs Unversy of

More information

American basket and spread options. with a simple binomial tree

American basket and spread options. with a simple binomial tree Amercan baske and spread opons wh a smple bnomal ree Svelana orovkova Vre Unverse Amserdam Jon work wh Ferry Permana acheler congress, Torono, June 22-26, 2010 1 Movaon Commody, currency baskes conss of

More information

Cryptographic techniques used to provide integrity of digital content in long-term storage

Cryptographic techniques used to provide integrity of digital content in long-term storage RB/3/2011 Crypographc echnques used o provde negry of dgal conen n long-erm sorage REPORT ON THE PROBLEM Problem presened by Marn Šmka Paweł Wojcechowsk Polsh Secury Prnng Works (PWPW) 1 Repor auhors Małgorzaa

More information

Data Mining Anomaly Detection. Lecture Notes for Chapter 10. Introduction to Data Mining

Data Mining Anomaly Detection. Lecture Notes for Chapter 10. Introduction to Data Mining Daa Mining Anomaly Deecion Lecure Noes for Chaper 10 Inroducion o Daa Mining by Tan, Seinbach, Kumar Tan,Seinbach, Kumar Inroducion o Daa Mining 4/18/2004 1 Anomaly/Oulier Deecion Wha are anomalies/ouliers?

More information

Estimating intrinsic currency values

Estimating intrinsic currency values Esmang nrnsc currency values Forex marke praconers consanly alk abou he srenghenng or weakenng of ndvdual currences. In hs arcle, Jan Chen and Paul Dous presen a new mehodology o quanfy hese saemens n

More information

Data Mining Anomaly Detection. Lecture Notes for Chapter 10. Introduction to Data Mining

Data Mining Anomaly Detection. Lecture Notes for Chapter 10. Introduction to Data Mining Daa Mining Anomaly Deecion Lecure Noes for Chaper 10 Inroducion o Daa Mining by Tan, Seinbach, Kumar Tan,Seinbach, Kumar Inroducion o Daa Mining 4/18/2004 1 Anomaly/Oulier Deecion Wha are anomalies/ouliers?

More information

UC San Diego Recent Work

UC San Diego Recent Work UC San Dego Recen Work Tle On More Robus Esmaon of Skewness and Kuross: Smulaon and Applcaon o he S&P500 Index Permalnk hps://escholarshp.org/uc/em/7b5v07p Auhors Km, Tae-Hwan Whe, Halber Publcaon Dae

More information

Optimal Combination of Trading Rules Using Neural Networks

Optimal Combination of Trading Rules Using Neural Networks Vol. 2, No. Inernaonal Busness Research Opmal Combnaon of Tradng Rules Usng Neural Neworks Subraa Kumar Mra Professor, Insue of Managemen Technology 35 Km Mlesone, Kaol Road Nagpur 44 502, Inda Tel: 9-72-280-5000

More information

A New Method to Measure the Performance of Leveraged Exchange-Traded Funds

A New Method to Measure the Performance of Leveraged Exchange-Traded Funds A ew Mehod o Measure he Performance of Leveraged Exchange-Traded Funds Ths verson: Sepember 03 ara Charupa DeGrooe School of Busness McMaser Unversy 80 Man Sree Wes Hamlon, Onaro L8S 4M4 Canada Tel: (905)

More information

Analysing Big Data to Build Knowledge Based System for Early Detection of Ovarian Cancer

Analysing Big Data to Build Knowledge Based System for Early Detection of Ovarian Cancer Indan Journal of Scence and Technology, Vol 8(4), DOI: 0.7485/js/205/v84/65745, July 205 ISSN (Prn) : 0974-6846 ISSN (Onlne) : 0974-5645 Analysng Bg Daa o Buld Knowledge Based Sysem for Early Deecon of

More information

Hardware-Assisted High-Efficiency Ray Casting of Unstructured Time-Varying Flows Using Temporal Coherence

Hardware-Assisted High-Efficiency Ray Casting of Unstructured Time-Varying Flows Using Temporal Coherence Hardware-Asssed Hgh-Effcency Ray Casng of Unsrucured Tme-Varyng Flows Usng Temporal Coherence Qanl Ma, Lang Zeng, Huaxun Xu, Wenke Wang, Skun L Absrac Advances n compuaonal power are enablng hgh-precson

More information

The Virtual Machine Resource Allocation based on Service Features in Cloud Computing Environment

The Virtual Machine Resource Allocation based on Service Features in Cloud Computing Environment Send Orders for Reprns o reprns@benhamscence.ae The Open Cybernecs & Sysemcs Journal, 2015, 9, 639-647 639 Open Access The Vrual Machne Resource Allocaon based on Servce Feaures n Cloud Compung Envronmen

More information

THE IMPACT OF COMMODITY DERIVATIVES IN AGRICULTURAL FUTURES MARKETS

THE IMPACT OF COMMODITY DERIVATIVES IN AGRICULTURAL FUTURES MARKETS Alghero, 25-27 June 20 Feedng he Plane and Greenng Agrculure: Challenges and opporunes for he bo-econom THE IMPACT OF COMMODITY DERIVATIVES IN AGRICULTURAL FUTURES MARKETS Zupprol M., Dona M., Verga G.,

More information

Tax Dispute Resolution and Taxpayer Screening

Tax Dispute Resolution and Taxpayer Screening DISCUSSION PAPER March 2016 No. 73 Tax Dspue Resoluon and Taxpayer Screenng Hdek SATO* Faculy of Economcs, Kyushu Sangyo Unversy ----- *E-Mal: hsao@p.kyusan-u.ac.jp Tax Dspue Resoluon and Taxpayer Screenng

More information

Pricing Model of Credit Default Swap Based on Jump-Diffusion Process and Volatility with Markov Regime Shift

Pricing Model of Credit Default Swap Based on Jump-Diffusion Process and Volatility with Markov Regime Shift Assocaon for Informaon Sysems AIS Elecronc brary (AISe) WICEB 13 Proceedngs Wuhan Inernaonal Conference on e-busness Summer 5-5-13 Prcng Model of Cred Defaul Swap Based on Jump-Dffuson Process and Volaly

More information

ANFIS Based Time Series Prediction Method of Bank Cash Flow Optimized by Adaptive Population Activity PSO Algorithm

ANFIS Based Time Series Prediction Method of Bank Cash Flow Optimized by Adaptive Population Activity PSO Algorithm Informaon 25, 6, 3-33; do:.339/nfo633 Arcle OPEN ACCESS nformaon ISSN 278-2489 www.mdp.com/journal/nformaon ANFIS Based Tme Seres Predcon Mehod of Bank Cash Flow Opmzed by Adapve Populaon Acvy PSO Algorhm

More information

SETTING CUT OFF SCORES FOR SELECTIVE EDITING IN STRUCTURAL BUSINESS STATISTICS: AN AUTOMATIC PROCEDURE USING SIMULATION STUDY.

SETTING CUT OFF SCORES FOR SELECTIVE EDITING IN STRUCTURAL BUSINESS STATISTICS: AN AUTOMATIC PROCEDURE USING SIMULATION STUDY. WP.41 ENGLISH ONLY UNITED NATIONS STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS Wor Sesson on Sascal Daa Edng (Neuchâel, Swzerland, 5-7 Ocober 2009) Topc

More information

Multiagent System Simulations of Sealed-Bid Auctions with Two-Dimensional Value Signals

Multiagent System Simulations of Sealed-Bid Auctions with Two-Dimensional Value Signals Deparmen Dscusson Paper DDP77 ISSN 94-2838 Deparmen of Economcs Mulagen Sysem Smulaons of Sealed-Bd Aucons wh Two-Dmensonal Value Sgnals Alan Mehlenbacher Deparmen of Economcs, Unversy of Vcora Vcora,

More information

Impact of Stock Markets on Economic Growth: A Cross Country Analysis

Impact of Stock Markets on Economic Growth: A Cross Country Analysis Impac of Sock Markes on Economc Growh: A Cross Counry Analyss By Muhammad Jaml Imporance of sock markes for poolng fnancal resources ncreased snce he las wo decades. Presen sudy analyzed mpac of sock markes

More information

The Proposed Mathematical Models for Decision- Making and Forecasting on Euro-Yen in Foreign Exchange Market

The Proposed Mathematical Models for Decision- Making and Forecasting on Euro-Yen in Foreign Exchange Market Iranan Economc Revew, Vol.6, No.30, Fall 20 The Proposed Mahemacal Models for Decson- Makng and Forecasng on Euro-Yen n Foregn Exchange Marke Abdorrahman Haer Masoud Rabban Al Habbna Receved: 20/07/24

More information

Effective Feedback Of Whole-Life Data to The Design Process

Effective Feedback Of Whole-Life Data to The Design Process Effecve Feedback Of Whole-Lfe Daa o The Desgn Process Mohammed Kshk 1*, Assem Al-Hajj 1, Rober Pollock 1 and Ghassan Aouad 2 1 The Sco Suherland School, The Rober Gordon Unversy, Garhdee Road, Aberdeen

More information

Documentation: Philadelphia Fed's Real-Time Data Set for Macroeconomists First-, Second-, and Third-Release Values

Documentation: Philadelphia Fed's Real-Time Data Set for Macroeconomists First-, Second-, and Third-Release Values Documenaion: Philadelphia Fed's Real-Time Daa Se for Macroeconomiss Firs-, Second-, and Third-Release Values Las Updaed: December 16, 2013 1. Inroducion We documen our compuaional mehods for consrucing

More information

RMF: Rough Set Membership Function-based for Clustering Web Transactions

RMF: Rough Set Membership Function-based for Clustering Web Transactions Inernaonal Journal of Mulmeda and Ubquous Engneerng Vol.8, No.6 (0), pp.05-8 hp://dx.do.org/0.57/mue.0.8.6. RMF: Rough Se Membershp Funcon-based for luserng Web Transacons Tuu Herawan and Wan Maser Wan

More information

FINAL EXAM EC26102: MONEY, BANKING AND FINANCIAL MARKETS MAY 11, 2004

FINAL EXAM EC26102: MONEY, BANKING AND FINANCIAL MARKETS MAY 11, 2004 FINAL EXAM EC26102: MONEY, BANKING AND FINANCIAL MARKETS MAY 11, 2004 This exam has 50 quesions on 14 pages. Before you begin, please check o make sure ha your copy has all 50 quesions and all 14 pages.

More information

Numerical Evaluation of European Option on a Non Dividend Paying Stock

Numerical Evaluation of European Option on a Non Dividend Paying Stock Inernaonal Journal of Compuaonal cence and Mahemacs. IN 0974-389 olume Number 3 (00) pp. 6--66 Inernaonal Research Publcaon House hp://www.rphouse.com Numercal Evaluaon of European Opon on a Non Dvdend

More information

Some Insights of Value-Added Tax Gap

Some Insights of Value-Added Tax Gap Ovdus Unversy Annals, Economc Scences Seres Some Insghs of Value-Added Tax Ga Cuceu Ionuţ-Consann Vădean Vorela-Lga Maşca Smona-Gabrela "Babeş-Bolya" Unversy Cluj-Naoca, Faculy of Economcs and Busness

More information

Forecasting Sales: Models, Managers (Experts) and their Interactions

Forecasting Sales: Models, Managers (Experts) and their Interactions Forecasing Sales: Models, Managers (Expers) and heir Ineracions Philip Hans Franses Erasmus School of Economics franses@ese.eur.nl ISF 203, Seoul Ouline Key issues Durable producs SKU sales Opimal behavior

More information

Online Data, Fixed Effects and the Construction of High-Frequency Price Indexes

Online Data, Fixed Effects and the Construction of High-Frequency Price Indexes Onlne Daa, Fxed Effecs and he Consrucon of Hgh-Frequency Prce Indexes Jan de Haan* and Rens Hendrks** * ascs eherlands / Delf Unversy of Technology ** ascs eherlands EMG Worksho 23 Ams of he aer Exlan

More information

Empirical analysis on China money multiplier

Empirical analysis on China money multiplier Aug. 2009, Volume 8, No.8 (Serial No.74) Chinese Business Review, ISSN 1537-1506, USA Empirical analysis on China money muliplier SHANG Hua-juan (Financial School, Shanghai Universiy of Finance and Economics,

More information

A Hybrid Method for Forecasting with an Introduction of a Day of the Week Index to the Daily Shipping Data of Sanitary Materials

A Hybrid Method for Forecasting with an Introduction of a Day of the Week Index to the Daily Shipping Data of Sanitary Materials Journal of Communcaon and Compuer (05) 0-07 do: 0.765/548-7709/05.0.00 D DAVID PUBLISHING A Hyrd Mehod for Forecasng wh an Inroducon of a Day of he Week Inde o he Daly Shppng Daa of Sanary Maerals Dasuke

More information

Classification and Prediction. Topic 5: Data Mining II. Classification Process (1): Model Construction. Classification A Two-Step Process

Classification and Prediction. Topic 5: Data Mining II. Classification Process (1): Model Construction. Classification A Two-Step Process Topc : Daa Mnng II Te classfcaon model assumes a se of predefned classes and ams o classfy a large collecon of uples/samples o ese classes class X class Y luserng s else called unsupervsed classfcaon;

More information

Online Adaboost-Based Parameterized Methods for Dynamic Distributed Network Intrusion Detection

Online Adaboost-Based Parameterized Methods for Dynamic Distributed Network Intrusion Detection Onlne Adaboos-Based Parameerzed Mehods or Dnamc Dsrbued Nework Inruson Deecon Wemng Hu, Jun Gao, Yanguo Wang, and Ou Wu (Naonal Laboraor o Paern Recognon, Insue o Auomaon, Chnese Academ o Scences, Beng

More information

Determinants of firm exchange rate predictions:

Determinants of firm exchange rate predictions: CESSA WP 208-0 Deermnans of frm exchange rae predcons: Emprcal evdence from survey daa of Japanese frms Th-Ngoc Anh NGUYEN Yokohama Naonal Unversy Japan Socey for he Promoon of Scence May 208 Cener for

More information

Price trends and patterns in technical analysis: A theoretical and empirical examination

Price trends and patterns in technical analysis: A theoretical and empirical examination Prce rends and paerns n echncal analyss: A heorecal and emprcal examnaon Geoffrey C. Fresen a*, Paul A. Weller b, Lee M. Dunham c a Deparmen of Fnance, College of Busness, Unversy of Nebraska Lncoln, Lncoln,

More information

A Novel Particle Swarm Optimization Approach for Grid Job Scheduling

A Novel Particle Swarm Optimization Approach for Grid Job Scheduling A Novel Parcle warm Opmzaon Approach for Grd ob chedulng Hesam Izaan, Behrouz Tor Ladan, Kamran Zamanfar, Ajh Abraham³ Islamc Azad Unversy, Ramsar branch, Ramsar, Iran zaan@gmal.com Deparmen of Compuer

More information

Semantic-based Detection of Segment Outliers and Unusual Events for Wireless Sensor Networks (Research-in-Progress)

Semantic-based Detection of Segment Outliers and Unusual Events for Wireless Sensor Networks (Research-in-Progress) Semanc-based Deecon of Segmen ulers and Unusual Evens for Wreless Sensor Neworks (Research-n-Progress) Lanl Gao eresearch Lab, School of ITEE, The Unversy of Queensland, Brsbane, Queensland 4072, Ausrala

More information

The impact of intellectual capital on returns and stock prices of listed companies in Tehran Stock Exchange

The impact of intellectual capital on returns and stock prices of listed companies in Tehran Stock Exchange Appled Scence Repors www.pscpub.com/asr -SSN: 231-944 / P-SSN: 2311-139 DO: 1.15192/PSCP.ASR.214.4.3.1516 App. Sc. Repor. 4 (3), 214: 15-16 PSC Publcaons The mpac of nellecual capal on reurns and sock

More information

A Neural Network Approach to Time Series Forecasting

A Neural Network Approach to Time Series Forecasting A Neural Nework Approach o Tme Seres Forecasng Iffa A. Gheyas, Lesle S. Smh Absrac We propose a smple approach for forecasng unvarae me seres. The proposed algorhm s an ensemble learnng echnque ha combnes

More information

MACROECONOMIC CONDITIONS AND INCOME DISTRIBUTION IN VENEZUELA:

MACROECONOMIC CONDITIONS AND INCOME DISTRIBUTION IN VENEZUELA: MACROECONOMIC CONDITIONS AND INCOME DISTRIBUTION IN VENEZUELA: 197-199 Raul J. Crespo* January, 2004 *Conac: Economcs Deparmen, Unversy of Brsol, 8 Woodland Road, Brsol, BS8 1TN, Uned Kngdom. Tel.: + 44

More information

Recall from last time. The Plan for Today. INTEREST RATES JUNE 22 nd, J u n e 2 2, Different Types of Credit Instruments

Recall from last time. The Plan for Today. INTEREST RATES JUNE 22 nd, J u n e 2 2, Different Types of Credit Instruments Reall from las me INTEREST RATES JUNE 22 nd, 2009 Lauren Heller Eon 423, Fnanal Markes Smple Loan rnpal and an neres paymen s pad a maury Fxed-aymen Loan Equal monhly paymens for a fxed number of years

More information

Stock Market Behaviour Around Profit Warning Announcements

Stock Market Behaviour Around Profit Warning Announcements Sock Marke Behaviour Around Profi Warning Announcemens Henryk Gurgul Conen 1. Moivaion 2. Review of exising evidence 3. Main conjecures 4. Daa and preliminary resuls 5. GARCH relaed mehodology 6. Empirical

More information

Online appendices from The xva Challenge by Jon Gregory. APPENDIX 14A: Deriving the standard CVA formula.

Online appendices from The xva Challenge by Jon Gregory. APPENDIX 14A: Deriving the standard CVA formula. Onlne appendces fro he xa Challenge by Jon Gregory APPNDX 4A: Dervng he sandard CA forla We wsh o fnd an expresson for he rsky vale of a need se of dervaves posons wh a ax ary dae Denoe he rsk-free vale

More information