We educate, promote and support the research, development and use of open source Python software in Washington, D.C. and metropolitan Maryland and Virginia.
Join us Tuesday March 10th at the new iStrategyLabs for Hadoop with Python!
Donald Miner will do a quick introduction to Apache Hadoop, then discuss the different ways Python can be used to get the job done in Hadoop. This includes writing MapReduce jobs in Python in various different ways, interacting with HBase, writing custom behavior in Pig and Hive, interacting with the Hadoop Distributed File System, using Spark, and integration with other corners of the Hadoop ecosystem. The state of Python with Hadoop is far from stable, so we'll spend some honest time talking about the state of these open source projects and what's missing will also be discussed.
Donald Miner is an avid user of Apache Hadoop, a practitioner of data science, and a long-time Python user. He serves as Chief Technology Officer at ClearEdge IT Solutions, a company that provides Big Data professional services. He is author of the O’Reilly book MapReduce Design Patterns, which is based on his experiences as a MapReduce developer. Donald has architected and implemented a number of mission-critical and large-scale Hadoop systems within the U.S. Government and Fortune 500 companies. He received his PhD from the University of Maryland, Baltimore County in Computer Science, where he focused on Machine Learning and Multi-Agent Systems. He lives in Maryland with his wife and two young sons.
Welcome to DCPython's new home on the interwebs.
It's been just about a month since we asked you all to beta test the new website. And during that month we've been busy. We'd like to thank the 200 people who took a look and gave us feedback, as well as the four awesome members who made pull requests to fix bugs and add features.Read more...