[session] 使用开源人工智能和机器学习工具训练现实世界的信用模型
讲师:Michael Li (The Data Incubator)
14:50–15:30 Friday, 2017-07-14
数据科学&高级分析 (Data science & advanced analytics)
地点: 多功能厅8A+8B(Function Room 8A+8B)
观众水平 (Level): Beginner
必要预备知识
Familiarity with basic concepts of data science. We will demonstrate technical tools but the talk should be comprehensible to non-technical people
您将学到什么
- See a real-life credit model be trained over real-world data over the course of the talk - Understand how to use open-source machine-learning and artificial-intelligence tools for credit modelling and default prediction - Demonstration of Python tools including numpy, scipy, pandas, statsmodels, and scikit-learn to illustrate basic usage as well as industry best practices - Demonstration of important techniques including neural networks, natural language processing, time series, processing structured and unstructured data, and supervised and unsupervised learning - Discussion of theoretical underpinnings as well as practical issues like productionizability and scalability
描述
在这个演讲中,我们将使用100%的开源机器学习和人工智能工具,迭代地训练和完善基于真实贷款业绩数据进行贷款违约预测的简单而强大的信贷模型。数据是基于在10年里发放的26亿美元的贷款。 我们将使用诸如numpy、scipy、pandas、statsmodels和scikit-learning之类的Python工具来演示基本用法以及行业最佳实践。 该讲座将涵盖神经网络、自然语言处理、时间序列、处理结构化和非结构化数据以及监督和无监督学习等技术,并会讨论相应的理论基础以及例如可生产化和可扩展性等实际使用中的问题。
讲师介绍:
Michael Li (The Data Incubator)
Tianhui Michael Li is the founder and CEO of the Data Incubator. Michael has worked as a data scientist lead at Foursquare, a quant at D.E. Shaw and JPMorgan, and a rocket scientist at NASA. At Foursquare, Michael discovered that his favorite part of the job was teaching and mentoring smart people about data science. He decided to build a startup that lets him focus on what he really loves. He did his PhD at Princeton as a Hertz fellow and read Part III Maths at Cambridge as a Marshall scholar.
Strata Data Conference北京站大会7月12号即将召开——
有需求的同学还请抓紧时间,
点击二维码即可登录会议官网报名。