Big Data & Machine Learning in HEP

Size: px
Start display at page:

Download "Big Data & Machine Learning in HEP"

Transcription

1 Big Data & Machine Learning in HEP Mike Williams Department of Physics & Laboratory for Nuclear Science Massachusetts Institute of Technology March 17, 216

2 The Large Hadron Collider Outline { Big Data Real-Time Analysis ML Propaganda

3 Big Data & Real-Time Processing

4 Big Data LHC detectors are BIG in both size and data output. E.g., ATLAS weighs over 7 tonnes and has million channels. Figure 1: Schematic diagram of the ATLAS detector at the LHC, which weighs over 7 tons The beam-crossing rate at the LHC is sensor 4 MHz, in each and has about million electronic channels in its arrayswhich that areresults used to record data detector from multiple proton-proton collisions that occur at a rate of 4 MHz. per year! producing ~petabyte / second of data or zettabytes

5 Big Data Figure 1: Schematic diagram of the ATLAS detector at the LHC, which weighs over 7 ton Custom-built electronics used to read out the sensor arrays all perform zero and has about million electronic channels in its sensor arrays that are used to record data from suppression to drastically reduce the data rates. multiple proton-proton collisions that occur at a rate of 4 MHz post zero-suppression data rates per experiment (ATLAS & CMS) data [EB] data rate [TB/s] 18 1 year By the HL-LHC era, each experiment will need to process in real time roughly 3 Figure 2: Data volumes that must be processed by the data-ingestion system of the CMS experi 15 TB/s or 6 EB/year (comparable to largest-scale industrial data sets). ment: (left axis, red circles) peak data rate in terabytes per second and (right axis, blue solid line 5

6 Real-Time Processing Each experiment currently implements both hardware and software triggering stages (cascading approach). I ll focus on LHCb (most advanced trigger). ~TB/s (post zero suppression) ~5 GB/s Feature-building in custom electronics (e.g. FPGAs) used to greatly reduce rate, but online computing systems still have herculean tasks. 6

7 Real-Time Processing 5 GB/s Real-time reconstruction for all tracks with pt >.5 GeV. 27, physical CPU cores (>5k logical cores) are used to run the full offline reconstruction in real time. 8 GB/s Data buffered on disk while alignment/calibration done. 8 GB/s Real-time reconstruction for all tracks, final PID, etc., all available to select events. PB of disk used for the buffering while final alignment/calibration done. Final event selection done with access to best-quality data (mostly done during down time between fills)..7 GB/s 7

8 Real-Time Processing In Run 3, LHCb will increase the post-zero-suppression data rate to 5 TB/s, but also remove the hardware trigger. The full reconstruction will be done in real time on all events. 5 TB/s (post zero suppression) ATLAS/CMS will go to 15 TB/s in Run 4, but for hermetic detectors it s not feasible to go straight to software (wifi data transfer being investigated). 8

9 Offline Processing The bulk of offline computing is done using the Worldwide LHC Computing Grid (WLCG) which has 17 computing centers around the world and runs over 2M jobs / day. Currently, the LHC experiments persist about 3 PB / year of data -- but this still isn t enough. Many strategies being explored (scouting, partial-event storage, data compression, etc) to increase the physics output. 9

10 Machine Learning in HEP

11 ML in LHCb Trigger 5 GB/s Real-time reconstruction for all tracks with pt >.5 GeV. Two ML algorithms used in this stage that select about 7% of the output BW. 8 GB/s Data buffered on disk while alignment/calibration done. 8 GB/s Final event selection is currently roughly 3% ML based (first introduced into the main trigger algorithm for the start of 211 data taking). LHCb trigger using a custom-built ML algorithm. Real-time reconstruction for all tracks, final PID, etc., all available to select events..7 GB/s V.Gligorov, MW, JINST 8 (212) P

12 Ghost Killers & PID Ghost-killing neural networks do an excellent job of removing fake tracks (which is a big problem with the occupancies of the LHC). ) 2 candidates / (4 MeV/c LHCb preliminary all D Kπ fakes Fraction of candidates per bin LHCb Simulated τ µ µ + µ Calibrated τ µ µ + µ Data sidebands M PID response m Kπ 2 [MeV/c ] Particle ID is now commonly done using neural networks as well. These combine the info from all subsystems and exploit t h e m a n y ( o f t e n o b s c u r e ) correlations between features to ID particles. 12

13 Higgs to CMS When the LHC started, people were still worried about using ML. CMS actually published both cut-based and BDT-based Higgs exclusions for the WW decay mode -- so we can see directly what is gained: 95% C.L. Limit on / SM 2 1 CMS Preliminary s=7 TeV, L = 4.9 fb -1 s=8 TeV, L = 19.5 fb H WW 2l2 /1-jet cut based -1 Observed Median Expected Expected ± 1 Expected ± 2 Injection m =125 GeV ± 1 H m H [GeV] 13

14 Higgs to CMS When the LHC started, people were still worried about using ML. CMS actually published both cut-based and BDT-based Higgs exclusions for the WW decay mode -- so we can see directly what is gained: 95% C.L. Limit on / SM 2 1 CMS Preliminary -1 s=7 TeV, L = 4.9 fb s=8 TeV, L = 19.5 fb H WW 2l2 /1-jet -1 Observed Median Expected Expected ± 1 Expected ± 2 Injection m =125 GeV ± 1 H m H [GeV] 14

15 Using BDTs There are 3 standard uses for classification: (1) place a cut on the BDT response; (2) fit the BDT response; or (3) analyze the data in bins of BDT response. For example, the recent LHCb pentaquark discovery analysis trained a BDT to identify the decay Λb J/ψpK then placed a cut on the BDT response for each candidate which gives the mass plot below. The BDT information was then not used further in the analysis (it just selects events to study). Events / ( 4 MeV) LHCb 26k Λb 5% BKGD in signal region LHCb-PAPER [MeV] m J/ψ K p 15

16 Entries /.25 Tag jets using a SV then feed properties to 2 BDTs. Fit the 2-D BDT response distribution in data to extract the b-jet, c-jet and lightparton-jet yields. Each BDT uses features as input (ML as dimensional-reduction). Data MC(B) Data MC(S+B) 7 Data χ 2 /dof -3.5= CMS -1 s = 7TeV, L = 5. fb -1 s = 8TeV, L = 18.9 fb pp VH; H bb VH Fit BDT Response Background VH(bb) 125 GeV CMS, PRD 89 (214) 123 Background uncert. -4 χ 2 /dof -3.5 = log(s/b) BDT(b c) LHCb data 19 light parton JINST (215) P613 LHCb-PAPER beauty charm Initial (no-tagging) sample: D+jet 7% light parton, 22% charm, 8% beauty BDT(bc udsg) Figure 6: From a b-jet and c-je Most Higgs analyses now also work this way; i.e., they involve fitting BDT distributions. Essentially, one can reduce a large-n-d fitting problem into a 1-D fitting problem (while still using the N-D info). (left: VH(bb), 18 features input). responses observed in data (ann are found); (middle) projection BDT templates shown here wer were used to calibrate the BDT 16

17 Bs μμ Calibrate BDT to have uniform response on signal, bin data in BDT response and analyze simultaneously. ] 3 Candidates / (4 MeV/c 2 ) [ Candidates / (4 MeV/c 2 ) LHCb, BDT [.,.25) Data Signal and background Bs + B + Combinatorial background Semi-leptonic background Peaking background m [GeV/c 2 + ] LHCb, BDT [.6,.7) m [GeV/c 2 + ] Candidates / (4 MeV/c 2 ) Candidates / (4 MeV/c 2 ) LHCb, BDT [.25,.4) m [GeV/c 2 + ] LHCb, BDT [.7,.8) m [GeV/c 2 + ] Candidates / (4 MeV/c 2 ) Candidates / (4 MeV/c 2 ) LHCb, BDT [.4,.5) m [GeV/c 2 + ] LHCb, BDT [.8,.9) m [GeV/c 2 + ] Candidates / (4 MeV/c 2 ) Candidates / (4 MeV/c 2 ) andidates / (4 MeV/c 2 ) LHCb, BDT [.5,.6) m [GeV/c 2 + ] LHCb, BDT [.9,1.] m [GeV/c 2 + ] CMS, 7TeV, FR, BDT [.26,1.] Combining CMS&LHCb data gives > 5σ and a BF ~ 3x -9 7(consistent with the SM expected value). CMS&LHCb, Nature 522, 68 (215). 17

18 Regression ML is also now being used for regression. For example, CMS has moved to ML-based jet-energy corrections (e.g., for Higgs to bb). ) -1 (GeV 1/N dn/dm bb CMS Simulation m H = 125 GeV (set A) Regressed Raw PEAK = GeV FWHM = 27. GeV PEAK = GeV FWHM = 32.8 GeV CMS, PRD 92 (215) m bb (GeV) 18

19 Advanced Topics ML algorithms where the response is de-correlated from some set of userdefined features. See Stevens and Williams [ ]; Rogozhnikova, Bukva, Gligorov, Ustyuzhanin, Williams [14.414]. Used to search for a new boson in PRL 115 (215) 16182, and currently being used in many papers to come out soon. Deep learning is actively being explored as a way to remove the human-led feature-design stage. See Baldi, Sadowski, Whiteson [ ]. Parametrized classifiers to interpolate the optimal selection when only a discrete subset of possible signals has been simulated. See Baldi, Cranmer, Faucett, Sadowski, Whiteson [ ]. Hyper-parameter tuning using Bayesian global optimization. Blending classifiers, etc, etc... 19

20 Tools Most physicists are using either ROOT s TMVA package or the python scikitlearn (sklearn for short). Some limited usage of proprietary packages too. See also pypi.python.org/pypi/hep_ml/.2. for some custom HEP algorithms. 2

21 Propaganda

22 Collaboration The few of us (physicists) who do research in this area are convinced that direct collaboration/engagement with computer/data scientists is critical. We d like to open the box to make it easier for those communities to do research on our (awesome) problems (please contact me if interested). 22

23 Title xxx 23

24 Advertisement ML school for Ph.D. students in Lund (follows one last year in Moscow). Go to bit.ly/mlhep216, or me, or 24

25 Summary HEP experiments are VERY big data. The online systems must process some of the world s largest data sets. Amazingly, we re now seeing real-time reconstruction, calibration, and alignment -- and soon we ll see a triggerless-readout system at the LHC! Not long ago, using a ML algorithm was considered unsafe ; however, now it s rare to see a HEP data analysis that does not use ML algorithms. ML is used to kill fake tracks, do particle ID, classify events, regression, dimensional reduction, etc. ML algorithms are even starting to dominate real-time (trigger) selections. There are many great (free) tools out there that are pretty easy to use. It s never been easier to get started. For students, there are dedicated summer schools to help. For everyone, there are now dedicated workshops. 25

Study of the rare decays of B 0 and B 0 to muon pairs with the ATLAS detector at LHC Run- 1

Study of the rare decays of B 0 and B 0 to muon pairs with the ATLAS detector at LHC Run- 1 Study of the rare decays of B 0 and B 0 to muon pairs s with the ATLAS detector at LHC Run- 1 Sandro Pales@ni (CERN) On behalf of the ATLAS Collabora@on Rencontres de Moriond EW interac@on and Unified

More information

The Higgs Particle Mass, Width and Couplings Seminar Particle Physics at the LHC

The Higgs Particle Mass, Width and Couplings Seminar Particle Physics at the LHC The Higgs Particle, and Seminar Particle Physics at the LHC Freiburg, 22.07.2014 Albert-Ludwigs-Universität Freiburg Michael Schubert Contents 2/54 introduction We found a Higgs boson! So... What now?

More information

Electron Identification Based on Boosted Decision Trees

Electron Identification Based on Boosted Decision Trees Electron Identification Based on Boosted Decision Trees Hai-Jun Yang University of Michigan, Ann Arbor (with T. Dai, X. Li, A. Wilson, B. Zhou) ATLAS Egamma Meeting October 2, 2008 Motivation Lepton (e,

More information

B-tagging based on Boosted Decision Trees

B-tagging based on Boosted Decision Trees B-tagging based on Boosted Decision Trees Haijun Yang University of Michigan (with Xuefei Li and Bing Zhou) ATLAS B-tagging Meeting CERN, July 7, 2009 1 Introduction Outline Boosted Decision Trees B-tagging

More information

LHC DATA! Current status of LHC and ATLAS and prospects for

LHC DATA! Current status of LHC and ATLAS and prospects for LHC DATA! Current status of LHC and ATLAS and prospects for 2010-2011 Andy Haas SLAC West Coast LHC Theory Meeting SCIPP, UC Santa Cruz May 21, -2010 Andy Haas 5/21/2010 Slide 1 Overview LHC commissioning

More information

Search for pairs of Higgs bosons in the bb τ + τ decay channel with the ATLAS detector

Search for pairs of Higgs bosons in the bb τ + τ decay channel with the ATLAS detector Search for pairs of Higgs bosons in the bb τ + τ decay channel with the ATLAS detector Petar Bokan Uppsala University, University of Göttingen supervisors: Arnaud Ferrari, Stan Lai Half-time PhD seminar

More information

Towards the LHC Start

Towards the LHC Start Science @ KIP Heidelberg, June 5, 2008 Towards the LHC Start What, why, how and a bit of daily life. Martin Wessels KIP, University of Heidelberg LH C est pas sorcier The Guardian "Particle physics is

More information

SANE Analysis Update

SANE Analysis Update SANE Analysis Update Artificial Neural Networks in Analyzing BETA Whitney Armstrong Temple University Physics Department January 23, 2010 Introduction Introduction 1 Spin Asymmetries of the Nucleon Experiment

More information

Extracting Higgs Couplings from LHC and ILC Data

Extracting Higgs Couplings from LHC and ILC Data Extracting Higgs Couplings from LHC and ILC Data Sven Heinemeyer (CSIC, Santander) Madrid, 09/2014 1. Introduction 2. Our tool: HiggsSignals 3. Higgs bosons couplings from LHC data 4. Higgs boson couplings

More information

DAMA Slides/Graphics

DAMA Slides/Graphics DAMA Slides/Graphics 000608 Rick Gaitskell Center for Particle Astrophysics UC Berkeley source at http://cdms.berkeley.edu/gaitskell/ Gaitskell Best fit to Ann Mod data alone Best Fit Minimum DAMA NaI/1-4

More information

A CBA MODEL FOR RESEARCH INFRASTRUCTURES The case of the Large Hadron Collider LHC at CERN

A CBA MODEL FOR RESEARCH INFRASTRUCTURES The case of the Large Hadron Collider LHC at CERN OECD GLOBAL SCIENCE FORUM WORKSHOP ON METHODOLOGIES AND TOOLS FOR ASSESSING SOCIO-ECONOMIC IMPACT OF RESEARCH INFRASTRUCTURES 3 November 215 - French Ministry for Research, Paris A CBA MODEL FOR RESEARCH

More information

The latest CALICE activities

The latest CALICE activities The latest CALICE activities Fabrizio Salvatore Royal Holloway University of London HEP seminar, RHUL, 22 nd January 2008 Rationale for a Linear Collider The LHC start up next year is expected to mark

More information

Presentation to PPAP meeting. Jordan Nash PPAN Chair

Presentation to PPAP meeting. Jordan Nash PPAN Chair Presentation to PPAP meeting Jordan Nash PPAN Chair Why this meeting? What is happening What do we need from the PP Community What is happening STFC needs to prepare a budget for 2010/2011 and planning

More information

Department Heads Meeting March 10, David B. MacFarlane

Department Heads Meeting March 10, David B. MacFarlane Department Heads Meeting March 10, 2011 David B. MacFarlane Topics News LDRD proposals: PPA internal deadline: March 18 with 1-pager Lab-wide: April 15 with full proposal Field Budget Request for FY2012

More information

Current Status of e-id based on BDT Algorithm

Current Status of e-id based on BDT Algorithm Current Status of e-id based on BDT Algorithm Hai-Jun Yang University of Michigan (with X. Li, T. Dai, A. Wilson, B. Zhou) BNL Analysis Jamboree March 19, 2009 Goals ATLAS default electron-id (IsEM) has

More information

An introduction to Machine learning methods and forecasting of time series in financial markets

An introduction to Machine learning methods and forecasting of time series in financial markets An introduction to Machine learning methods and forecasting of time series in financial markets Mark Wong markwong@kth.se December 10, 2016 Abstract The goal of this paper is to give the reader an introduction

More information

Project Management Professional (PMP) Exam Prep Course 06 - Project Time Management

Project Management Professional (PMP) Exam Prep Course 06 - Project Time Management Project Management Professional (PMP) Exam Prep Course 06 - Project Time Management Slide 1 Looking Glass Development, LLC (303) 663-5402 / (888) 338-7447 4610 S. Ulster St. #150 Denver, CO 80237 information@lookingglassdev.com

More information

Instruction (Manual) Document

Instruction (Manual) Document Instruction (Manual) Document This part should be filled by author before your submission. 1. Information about Author Your Surname Your First Name Your Country Your Email Address Your ID on our website

More information

Probing Higgs self-couplings at Future Colliders

Probing Higgs self-couplings at Future Colliders Probing iggs self-couplings at Future Colliders Ambresh Shivaji IISER Mohali, Punjab, INDIA Mini-Workshop: Theory Physics Opportunities and Advanced Tools IAS (KUST), January -, 29 Ambresh Shivaji (IISER

More information

Modern Methods of Data Analysis - SS 2009

Modern Methods of Data Analysis - SS 2009 Modern Methods of Data Analysis Lecture II (7.04.09) Contents: Characterize data samples Characterize distributions Correlations, covariance Reminder: Average of a Sample arithmetic mean of data set: weighted

More information

A proposal for a CERN-Japan collaboration programme as a reform of the Asia-CERN Fellowship Programme. July 13, 2004 (Version-3 July 30, 2004)

A proposal for a CERN-Japan collaboration programme as a reform of the Asia-CERN Fellowship Programme. July 13, 2004 (Version-3 July 30, 2004) A proposal for a CERN-Japan collaboration programme as a reform of the Asia-CERN Fellowship Programme July 13, 2004 (Version-3 July 30, 2004) Takahiko Kondo (KEK and Japanese ATLAS group member) Summary

More information

Comparability in Meaning Cross-Cultural Comparisons Andrey Pavlov

Comparability in Meaning Cross-Cultural Comparisons Andrey Pavlov Introduction Comparability in Meaning Cross-Cultural Comparisons Andrey Pavlov The measurement of abstract concepts, such as personal efficacy and privacy, in a cross-cultural context poses problems of

More information

Model Maestro. Scorto TM. Specialized Tools for Credit Scoring Models Development. Credit Portfolio Analysis. Scoring Models Development

Model Maestro. Scorto TM. Specialized Tools for Credit Scoring Models Development. Credit Portfolio Analysis. Scoring Models Development Credit Portfolio Analysis Scoring Models Development Scorto TM Models Analysis and Maintenance Model Maestro Specialized Tools for Credit Scoring Models Development 2 Purpose and Tasks to Be Solved Scorto

More information

Application of Big Data Analytics via Soft Computing. Yunus Yetis

Application of Big Data Analytics via Soft Computing. Yunus Yetis Application of Big Data Analytics via Soft Computing Yunus Yetis INTRODUCTION Ø System of Systems (SoS) and cyberphysic are integrated, independently operating systems working in a cooperative mode to

More information

LAST SECTION!!! 1 / 36

LAST SECTION!!! 1 / 36 LAST SECTION!!! 1 / 36 Some Topics Probability Plotting Normal Distributions Lognormal Distributions Statistics and Parameters Approaches to Censor Data Deletion (BAD!) Substitution (BAD!) Parametric Methods

More information

Business Statistics 41000: Probability 3

Business Statistics 41000: Probability 3 Business Statistics 41000: Probability 3 Drew D. Creal University of Chicago, Booth School of Business February 7 and 8, 2014 1 Class information Drew D. Creal Email: dcreal@chicagobooth.edu Office: 404

More information

SLAC PPA Budget Discussion: FY09 Allocations Among BNR Codes

SLAC PPA Budget Discussion: FY09 Allocations Among BNR Codes SLAC PPA Budget Discussion: FY09 Allocations Among BNR Codes S. Kahn, D. MacFarlane, R. Blandford, T. Raubenheimer, C. Chang Particle Physics and Astrophysics Directorate Page 1 Purpose of This Meeting

More information

From LEP to the Large Hadron CERN Spring School, LNF, Frascati, 15 May 2006

From LEP to the Large Hadron CERN Spring School, LNF, Frascati, 15 May 2006 From LEP to the Large Hadron Collider @ CERN Spring School, LNF, Frascati, 15 May 2006 Luciano Maiani, Universita di Roma La Sapienza (a real photo!!!) A revolutionary tool: Collision Rings beam-beam collisions

More information

Strike Point Control on EAST Using an Isoflux Control Method

Strike Point Control on EAST Using an Isoflux Control Method Plasma Science and Technology, Vol.17, No.9, Sep. 2015 Strike Point Control on EAST Using an Isoflux Control Method XING Zhe ( ) 1, XIAO Bingjia ( ) 1,2, LUO Zhengping ( ) 1, M. L. WALKER 3, D. A. HUMPHREYS

More information

Electron Lenses. for the LHC

Electron Lenses. for the LHC Electron Lenses for the LHC Vladimir Shiltsev With contributions from Yu.Alexahin, J.Johnstone, V.Kamerdzhiev, U.Dorda LARP B-B-Compensation Workshop, SLAC, July 2-4, 2007 Content On possible uses of ELs

More information

Bond Pricing AI. Liquidity Risk Management Analytics.

Bond Pricing AI. Liquidity Risk Management Analytics. Bond Pricing AI Liquidity Risk Management Analytics www.overbond.com Fixed Income Artificial Intelligence The financial services market is embracing digital processes and artificial intelligence applications

More information

Supersedes: 9/01/11 (Rev.5) Preparer: Owner: Approver: Team Member, North America Process Safety Center of Expertise

Supersedes: 9/01/11 (Rev.5) Preparer: Owner: Approver: Team Member, North America Process Safety Center of Expertise Procedure No.: BC032.019 Page: 1 of 12 Preparer: Owner: Approver: Team Member, North America Process Safety Center of Expertise Manager, North America Process Safety Center of Expertise Sr. Vice President,

More information

BlitzTrader. Next Generation Algorithmic Trading Platform

BlitzTrader. Next Generation Algorithmic Trading Platform BlitzTrader Next Generation Algorithmic Trading Platform Introduction TRANSFORM YOUR TRADING IDEAS INTO ACTION... FAST TIME TO THE MARKET BlitzTrader is next generation, most powerful, open and flexible

More information

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley. Appendix: Statistics in Action Part I Financial Time Series 1. These data show the effects of stock splits. If you investigate further, you ll find that most of these splits (such as in May 1970) are 3-for-1

More information

Deep Learning - Financial Time Series application

Deep Learning - Financial Time Series application Chen Huang Deep Learning - Financial Time Series application Use Deep learning to learn an existing strategy Warning Don t Try this at home! Investment involves risk. Make sure you understand the risk

More information

TIE2140 / IE2140e Engineering Economy Tutorial 6 (Lab 2) Engineering-Economic Decision Making Process using EXCEL

TIE2140 / IE2140e Engineering Economy Tutorial 6 (Lab 2) Engineering-Economic Decision Making Process using EXCEL TIE2140 / IE2140e Engineering Economy Tutorial 6 (Lab 2) Engineering-Economic Decision Making Process using EXCEL Solutions Guide by Wang Xin, Hong Lanqing & Mei Wenjie 1. Learning Objectives In this lab-based

More information

To apply SP models we need to generate scenarios which represent the uncertainty IN A SENSIBLE WAY, taking into account

To apply SP models we need to generate scenarios which represent the uncertainty IN A SENSIBLE WAY, taking into account Scenario Generation To apply SP models we need to generate scenarios which represent the uncertainty IN A SENSIBLE WAY, taking into account the goal of the model and its structure, the available information,

More information

Credit Card Default Predictive Modeling

Credit Card Default Predictive Modeling Credit Card Default Predictive Modeling Background: Predicting credit card payment default is critical for the successful business model of a credit card company. An accurate predictive model can help

More information

The Cost of Poor Reliability for Future Particle Accelerator Driven Systems in the UK Energy Market

The Cost of Poor Reliability for Future Particle Accelerator Driven Systems in the UK Energy Market The Cost of Poor Reliability for Future Particle Accelerator Driven Systems in the UK Energy Market Steven J Steer, William J Nuttall, Geoff T Parks & Leonardo V N Gonçalves Accelerator Driven Nuclear

More information

What is HomeBlockCoin

What is HomeBlockCoin Whitepaper This is not the final version of the HomeBlockCoin whitepaper. There won t be any significant changes but we will add more information as soon as possible. About HomeBlockCoin HomeBlockCoin

More information

Statistical and Machine Learning Approach in Forex Prediction Based on Empirical Data

Statistical and Machine Learning Approach in Forex Prediction Based on Empirical Data Statistical and Machine Learning Approach in Forex Prediction Based on Empirical Data Sitti Wetenriajeng Sidehabi Department of Electrical Engineering Politeknik ATI Makassar Makassar, Indonesia tenri616@gmail.com

More information

Summary of Expenditure for CMS Maintenance & Operations for the Year 2015

Summary of Expenditure for CMS Maintenance & Operations for the Year 2015 Summary of Expenditure for CMS Maintenance & Operations for the Year 2015 INTRODUCTION This document summarizes the expenditure that the CMS Collaboration has made in 2015 in order to maintain and operate

More information

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18,   ISSN International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL

More information

6.231 DYNAMIC PROGRAMMING LECTURE 8 LECTURE OUTLINE

6.231 DYNAMIC PROGRAMMING LECTURE 8 LECTURE OUTLINE 6.231 DYNAMIC PROGRAMMING LECTURE 8 LECTURE OUTLINE Suboptimal control Cost approximation methods: Classification Certainty equivalent control: An example Limited lookahead policies Performance bounds

More information

Predictive Analytics STUART KLUGMAN. Senior Staff Fellow. June 7, 2018

Predictive Analytics STUART KLUGMAN. Senior Staff Fellow. June 7, 2018 Predictive Analytics STUART KLUGMAN Senior Staff Fellow June 7, 2018 Predictive Analytics What is it? Why does the SOA care about it? What is the SOA doing about it? Pre-qualification education Continuing

More information

Curve fitting for calculating SCR under Solvency II

Curve fitting for calculating SCR under Solvency II Curve fitting for calculating SCR under Solvency II Practical insights and best practices from leading European Insurers Leading up to the go live date for Solvency II, insurers in Europe are in search

More information

Model Maestro. Scorto. Specialized Tools for Credit Scoring Models Development. Credit Portfolio Analysis. Scoring Models Development

Model Maestro. Scorto. Specialized Tools for Credit Scoring Models Development. Credit Portfolio Analysis. Scoring Models Development Credit Portfolio Analysis Scoring Models Development Scorto TM Models Analysis and Maintenance Model Maestro Specialized Tools for Credit Scoring Models Development 2 Purpose and Tasks to Be Solved Scorto

More information

Lecture 9: Classification and Regression Trees

Lecture 9: Classification and Regression Trees Lecture 9: Classification and Regression Trees Advanced Applied Multivariate Analysis STAT 2221, Spring 2015 Sungkyu Jung Department of Statistics, University of Pittsburgh Xingye Qiao Department of Mathematical

More information

Liquidity Regulation and Credit Booms: Theory and Evidence from China. JRCPPF Sixth Annual Conference February 16-17, 2017

Liquidity Regulation and Credit Booms: Theory and Evidence from China. JRCPPF Sixth Annual Conference February 16-17, 2017 Liquidity Regulation and Credit Booms: Theory and Evidence from China Kinda Hachem Chicago Booth and NBER Zheng Michael Song Chinese University of Hong Kong JRCPPF Sixth Annual Conference February 16-17,

More information

Office of Science. Experiences from the Large Hadron Collider. U.S. Department of Energy s. Burning Plasma Physics Advisory Committee (BPPAC) Meeting

Office of Science. Experiences from the Large Hadron Collider. U.S. Department of Energy s. Burning Plasma Physics Advisory Committee (BPPAC) Meeting s Experiences from the Large Hadron Collider Burning Plasma Physics Advisory Committee (BPPAC) Meeting Jim Yeck April 29, 2003 OUTLINE Overview of the LHC CERN Experience U.S. LHC Construction Project

More information

Implied Volatility using Python s Pandas Library

Implied Volatility using Python s Pandas Library Implied Volatility using Python s Pandas Library Brian Spector New York Quantitative Python Users Group March 6 th 2014 Experts in numerical algorithms and HPC services Introduction Motivation Python Pandas

More information

Web Science & Technologies University of Koblenz Landau, Germany. Lecture Data Science. Statistics and Probabilities JProf. Dr.

Web Science & Technologies University of Koblenz Landau, Germany. Lecture Data Science. Statistics and Probabilities JProf. Dr. Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Statistics and Probabilities JProf. Dr. Claudia Wagner Data Science Open Position @GESIS Student Assistant Job in Data

More information

Income Tax Planner. The most trusted, widely used tax planning software.

Income Tax Planner. The most trusted, widely used tax planning software. Income Tax Planner The most trusted, widely used tax planning software. Makes Tax Planning Effective, Accurate, Efficient. Analyzing the full, potential tax implications of life and financial changes such

More information

CIF Stock Recommendation Report (Fall 2012)

CIF Stock Recommendation Report (Fall 2012) Date: 10/13/2012 Analyst Name: Ryan Ellingsen CIF Stock Recommendation Report (Fall 2012) Company Name and Ticker: Colgate-Palmolive Company (CL) Section (A) Summary Recommendation Buy: Hold Target Price:

More information

Look Elsewhere Effect

Look Elsewhere Effect Eilam Gross and Ofer Vitells Weizmann Institute of Science 1 Eilam Gross & Ofer Vitells, Banff A Plausible Thumb Rule for a Trial # Eilam Gross and Ofer Vitells Weizmann Institute of Science 2 Eilam Gross

More information

Predicting Foreign Exchange Arbitrage

Predicting Foreign Exchange Arbitrage Predicting Foreign Exchange Arbitrage Stefan Huber & Amy Wang 1 Introduction and Related Work The Covered Interest Parity condition ( CIP ) should dictate prices on the trillion-dollar foreign exchange

More information

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017 Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017 The time limit for this exam is four hours. The exam has four sections. Each section includes two questions.

More information

CHEP An update on the scalability limits of the Condor batch system. by Igor Sfiligoi for the UWisc and Red Hat Condor teams doing the real work

CHEP An update on the scalability limits of the Condor batch system. by Igor Sfiligoi for the UWisc and Red Hat Condor teams doing the real work CHEP 2010 An update on the scalability limits of the Condor batch system by Igor Sfiligoi for the UWisc and Red Hat Condor teams doing the real work CHEP 2010 Update on Condor scalability 1 What is Condor

More information

Minimizing Basis Risk for Cat-In- Catastrophe Bonds Editor s note: AIR Worldwide has long dominanted the market for. By Dr.

Minimizing Basis Risk for Cat-In- Catastrophe Bonds Editor s note: AIR Worldwide has long dominanted the market for. By Dr. Minimizing Basis Risk for Cat-In- A-Box Parametric Earthquake Catastrophe Bonds Editor s note: AIR Worldwide has long dominanted the market for 06.2010 AIRCurrents catastrophe risk modeling and analytical

More information

The Dynamic Cross-sectional Microsimulation Model MOSART

The Dynamic Cross-sectional Microsimulation Model MOSART Third General Conference of the International Microsimulation Association Stockholm, June 8-10, 2011 The Dynamic Cross-sectional Microsimulation Model MOSART Dennis Fredriksen, Pål Knudsen and Nils Martin

More information

A Machine Learning Approach to Price Impact Modeling Using NASDAQ Level-2 ITCH Data

A Machine Learning Approach to Price Impact Modeling Using NASDAQ Level-2 ITCH Data A Machine Learning Approach to Price Impact Modeling Using NASDAQ Level-2 ITCH Data Honors Thesis Defense : Computer Science Student: Jacob Brewer Supervisor: Dr. Sean Warnick Brigham Young University

More information

Based on the audacious premise that a lot more can be done with a lot less.

Based on the audacious premise that a lot more can be done with a lot less. A lot less of IT involvement, minimal processes, greater attention to high-value tasks, enhanced decision-making all resulting in better underwriting. Based on the audacious premise that a lot more can

More information

November 29, 2013 Fish4Knowledge Final Review 1/31. Welcome. de Recherche en Informatique

November 29, 2013 Fish4Knowledge Final Review 1/31. Welcome. de Recherche en Informatique November 29, 2013 Fish4Knowledge Final Review 1/31 Welcome Jenny Benois-Pineau Rafael Garcia Stefano Bertolo LABRI - Laboratoire Bordelais de Recherche en Informatique University of Girona European Commission

More information

Machine Learning Applications in Insurance

Machine Learning Applications in Insurance General Public Release Machine Learning Applications in Insurance Nitin Nayak, Ph.D. Digital & Smart Analytics Swiss Re General Public Release Machine learning is.. Giving computers the ability to learn

More information

WESTERNPIPS TRADER 3.9

WESTERNPIPS TRADER 3.9 WESTERNPIPS TRADER 3.9 FIX API HFT Arbitrage Trading Software 2007-2017 - 1 - WESTERNPIPS TRADER 3.9 SOFTWARE ABOUT WESTERNPIPS TRADER 3.9 SOFTWARE THE DAY HAS COME, WHICH YOU ALL WERE WAITING FOR! PERIODICALLY

More information

Bill Boroski LQCD Contractor Project Manager Robert D. Kennedy Associate Contractor Project Manager

Bill Boroski LQCD Contractor Project Manager Robert D. Kennedy Associate Contractor Project Manager Bill Boroski LQCD Contractor Project Manager Robert D. Kennedy Associate Contractor Project Manager DOE FY2015 Annual Progress Review Brookhaven National Laboratory May 21-22, 2015 Project scope, organization,

More information

Update on USDA s Transition to New Official Moisture Technology

Update on USDA s Transition to New Official Moisture Technology Update on USDA s Transition to New Official Moisture Technology Randall Jones, Deputy Administrator, GIPSA David Funk, Chief Scientist, GIPSA United States Department of Agriculture Grain Inspection, Packers

More information

Complexity is a challenge in the insurance industry. Products,

Complexity is a challenge in the insurance industry. Products, By Van Beach Complexity is a challenge in the insurance industry. Products, regulations, and the underlying risks of insurance are difficult to quantify, manage, and explain. Actuarial modeling has felt

More information

A Novel Method of Trend Lines Generation Using Hough Transform Method

A Novel Method of Trend Lines Generation Using Hough Transform Method International Journal of Computing Academic Research (IJCAR) ISSN 2305-9184, Volume 6, Number 4 (August 2017), pp.125-135 MEACSE Publications http://www.meacse.org/ijcar A Novel Method of Trend Lines Generation

More information

Monitoring - revisited

Monitoring - revisited Monitoring - revisited Anders Ringgaard Kristensen Slide 1 Outline Filtering techniques applied to monitoring of daily gain in slaughter pigs: Introduction Basic monitoring Shewart control charts DLM and

More information

Anurag Sodhi University of North Carolina at Charlotte

Anurag Sodhi University of North Carolina at Charlotte American Put Option pricing using Least squares Monte Carlo method under Bakshi, Cao and Chen Model Framework (1997) and comparison to alternative regression techniques in Monte Carlo Anurag Sodhi University

More information

THE PROBLEM THERE IS AN INFORMATION CRISIS IN CONSUMER FINANCE LATIKA. Emilian. Alternative online lender without enough data

THE PROBLEM THERE IS AN INFORMATION CRISIS IN CONSUMER FINANCE LATIKA. Emilian. Alternative online lender without enough data THE PROBLEM THERE IS AN INFORMATION CRISIS IN CONSUMER FINANCE NEEDS A LOAN WANTS TO LEND LATIKA Small business owner in India Emilian Alternative online lender without enough data INTRODUCTION WHAT IS

More information

The Assumption(s) of Normality

The Assumption(s) of Normality The Assumption(s) of Normality Copyright 2000, 2011, 2016, J. Toby Mordkoff This is very complicated, so I ll provide two versions. At a minimum, you should know the short one. It would be great if you

More information

CHEP An update on the scalability limits of the Condor batch system

CHEP An update on the scalability limits of the Condor batch system CHEP 2010 An update on the scalability limits of the Condor batch system by Igor Sfiligoi for the UWisc Condor and Red Hat MRG Grid teams doing the real work CHEP 2010 Update on Condor scalability 1 What

More information

Understanding neural networks

Understanding neural networks Machine Learning Neural Networks Understanding neural networks An Artificial Neural Network (ANN) models the relationship between a set of input signals and an output signal using a model derived from

More information

Are New Modeling Techniques Worth It?

Are New Modeling Techniques Worth It? Are New Modeling Techniques Worth It? Tom Zougas PhD PEng, Manager Data Science, TransUnion TORONTO SAS USER GROUP MAY 2, 2018 Are New Modeling Techniques Worth It? Presenter Tom Zougas PhD PEng, Manager

More information

Global population projections by the United Nations John Wilmoth, Population Association of America, San Diego, 30 April Revised 5 July 2015

Global population projections by the United Nations John Wilmoth, Population Association of America, San Diego, 30 April Revised 5 July 2015 Global population projections by the United Nations John Wilmoth, Population Association of America, San Diego, 30 April 2015 Revised 5 July 2015 [Slide 1] Let me begin by thanking Wolfgang Lutz for reaching

More information

Sensitivity analysis for risk-related decision-making

Sensitivity analysis for risk-related decision-making Sensitivity analysis for risk-related decision-making Eric Marsden What are the key drivers of my modelling results? Sensitivity analysis: intuition X is a sensitive

More information

Barrier Option. 2 of 33 3/13/2014

Barrier Option. 2 of 33 3/13/2014 FPGA-based Reconfigurable Computing for Pricing Multi-Asset Barrier Options RAHUL SRIDHARAN, GEORGE COOKE, KENNETH HILL, HERMAN LAM, ALAN GEORGE, SAAHPC '12, PROCEEDINGS OF THE 2012 SYMPOSIUM ON APPLICATION

More information

Overview of 2010 delivered physics Constructive criticism on 2010 LHC operation

Overview of 2010 delivered physics Constructive criticism on 2010 LHC operation LHC Operation as viewed from the experiments Thanks to many people who gave me comments, suggestions or material for this presentation Overview of 2010 delivered physics Constructive criticism on 2010

More information

Better decision making under uncertain conditions using Monte Carlo Simulation

Better decision making under uncertain conditions using Monte Carlo Simulation IBM Software Business Analytics IBM SPSS Statistics Better decision making under uncertain conditions using Monte Carlo Simulation Monte Carlo simulation and risk analysis techniques in IBM SPSS Statistics

More information

Usage of LCG-France Tier-2s 2010 summary

Usage of LCG-France Tier-2s 2010 summary README LCG France Usage of LCG-France Tier-2s 2010 summary EDMS id I-020370 https://edms.in2p3.fr/document/i 020370/ Créé le 03/03/2010 MaJ 03/02/2011 Auteurs F.Chollet Source : http://www3.egee.cesga.es/gridsite/accounting/cesga/tier2_view.html

More information

A COMPARATIVE STUDY OF DATA MINING TECHNIQUES IN PREDICTING CONSUMERS CREDIT CARD RISK IN BANKS

A COMPARATIVE STUDY OF DATA MINING TECHNIQUES IN PREDICTING CONSUMERS CREDIT CARD RISK IN BANKS A COMPARATIVE STUDY OF DATA MINING TECHNIQUES IN PREDICTING CONSUMERS CREDIT CARD RISK IN BANKS Ling Kock Sheng 1, Teh Ying Wah 2 1 Faculty of Computer Science and Information Technology, University of

More information

DATA HANDLING Five-Number Summary

DATA HANDLING Five-Number Summary DATA HANDLING Five-Number Summary The five-number summary consists of the minimum and maximum values, the median, and the upper and lower quartiles. The minimum and the maximum are the smallest and greatest

More information

Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms

Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms Volume 119 No. 12 2018, 15395-15405 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms 1

More information

Announcing Pro Tools V2.0 March 28, 2015

Announcing Pro Tools V2.0 March 28, 2015 Announcing Pro Tools V2.0 March 28, 2015 A letter from the CEO.. Greetings OmniVest subscribers! Spring is in the air and it s the perfect time for me to update you on our magnificent OmniVest Professional

More information

Contact: Structural Policy Division, Mr. Danny Scorpecci. tel: ; fax: ; e- mail:

Contact: Structural Policy Division, Mr. Danny Scorpecci. tel: ; fax: ; e- mail: Unclassified C/WP6(2006)7 C/WP6(2006)7 Unclassified Organisation de Coopération et de Développement Economiques Organisation for Economic Co-operation and Development 25-Oct-2006 English - Or. English

More information

The Use of Artificial Neural Network for Forecasting of FTSE Bursa Malaysia KLCI Stock Price Index

The Use of Artificial Neural Network for Forecasting of FTSE Bursa Malaysia KLCI Stock Price Index The Use of Artificial Neural Network for Forecasting of FTSE Bursa Malaysia KLCI Stock Price Index Soleh Ardiansyah 1, Mazlina Abdul Majid 2, JasniMohamad Zain 2 Faculty of Computer System and Software

More information

6.825 Homework 3: Solutions

6.825 Homework 3: Solutions 6.825 Homework 3: Solutions 1 Easy EM You are given the network structure shown in Figure 1 and the data in the following table, with actual observed values for A, B, and C, and expected counts for D.

More information

Jacob: What data do we use? Do we compile paid loss triangles for a line of business?

Jacob: What data do we use? Do we compile paid loss triangles for a line of business? PROJECT TEMPLATES FOR REGRESSION ANALYSIS APPLIED TO LOSS RESERVING BACKGROUND ON PAID LOSS TRIANGLES (The attached PDF file has better formatting.) {The paid loss triangle helps you! distinguish between

More information

ALICE EMCal Project Execution Plan

ALICE EMCal Project Execution Plan ALICE EMCal Project Execution Plan at Lawrence Berkeley National Laboratory Berkeley, CA For the U.S. Department of Energy Office of Science Office of Nuclear Physics MIE-71-RC Document EMCal.4.v4 January

More information

STA2601. Tutorial letter 105/2/2018. Applied Statistics II. Semester 2. Department of Statistics STA2601/105/2/2018 TRIAL EXAMINATION PAPER

STA2601. Tutorial letter 105/2/2018. Applied Statistics II. Semester 2. Department of Statistics STA2601/105/2/2018 TRIAL EXAMINATION PAPER STA2601/105/2/2018 Tutorial letter 105/2/2018 Applied Statistics II STA2601 Semester 2 Department of Statistics TRIAL EXAMINATION PAPER Define tomorrow. university of south africa Dear Student Congratulations

More information

Forecasting Initial Public Offering Pricing Using Particle Swarm Optimization (PSO) Algorithm and Support Vector Machine (SVM) In Iran

Forecasting Initial Public Offering Pricing Using Particle Swarm Optimization (PSO) Algorithm and Support Vector Machine (SVM) In Iran Forecasting Initial Public Offering Pricing Using Particle Swarm Optimization (PSO) Algorithm and Support Vector Machine (SVM) In Iran Shaho Heidari Gandoman (Corresponding author) Department of Accounting,

More information

Tests for Two Means in a Multicenter Randomized Design

Tests for Two Means in a Multicenter Randomized Design Chapter 481 Tests for Two Means in a Multicenter Randomized Design Introduction In a multicenter design with a continuous outcome, a number of centers (e.g. hospitals or clinics) are selected at random

More information

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18,   ISSN Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL NETWORKS K. Jayanthi, Dr. K. Suresh 1 Department of Computer

More information

DATA GAPS AND NON-CONFORMITIES

DATA GAPS AND NON-CONFORMITIES 17-09-2013 - COMPLIANCE FORUM - TASK FORCE MONITORING - FINAL VERSION WORKING PAPER ON DATA GAPS AND NON-CONFORMITIES Content 1. INTRODUCTION... 3 2. REQUIREMENTS BY THE MRR... 3 3. TYPICAL SITUATIONS...

More information

Monotonically Constrained Bayesian Additive Regression Trees

Monotonically Constrained Bayesian Additive Regression Trees Constrained Bayesian Additive Regression Trees Robert McCulloch University of Chicago, Booth School of Business Joint with: Hugh Chipman (Acadia), Ed George (UPenn, Wharton), Tom Shively (U Texas, McCombs)

More information

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.)

Chapter 6. y y. Standardizing with z-scores. Standardizing with z-scores (cont.) Starter Ch. 6: A z-score Analysis Starter Ch. 6 Your Statistics teacher has announced that the lower of your two tests will be dropped. You got a 90 on test 1 and an 85 on test 2. You re all set to drop

More information

How to Consider Risk Demystifying Monte Carlo Risk Analysis

How to Consider Risk Demystifying Monte Carlo Risk Analysis How to Consider Risk Demystifying Monte Carlo Risk Analysis James W. Richardson Regents Professor Senior Faculty Fellow Co-Director, Agricultural and Food Policy Center Department of Agricultural Economics

More information

THE F FILES. Group benefits fraud what you need to know to fight fraud GET #FRAUDSMART

THE F FILES. Group benefits fraud what you need to know to fight fraud GET #FRAUDSMART THE F FILES Group benefits fraud what you need to know to fight fraud GET #FRAUDSMART SPRING 2018 LOOKING INTO THE FUTURE OF FRAUD WITH PREDICTIVE ANALYTICS Big data it is fundamental in the fight against

More information