Orbitz Worldwide (NYSE:OWW) is composed of a global portfolio of online consumer travel brands including Orbitz, Cheaptickets, The Away Network, ebookers and HotelClub, Additionally, the company operates business-to-business services: Orbitz Worldwide Distribution provides third parties such as Amtrak, Delta, LAN, KLM, Air France and a number of other leading airlines hotel booking capabilities, and Orbitz for Business provides corporate travel services to a number of Fortune 100 clients. The Orbitz Worldwide sites process millions of searches and transactions every day, which not surprisingly results in hundreds of gigabytes of log data per day.
Some of the challenges we face analyzing these large amounts of data are: sparse, multi-sourced data, performing multidimensional analysis on both the online and offline data, developing a data driven culture, and working in a "centralized decentralization" model. This talk will detail how the Web Analytics team leverages big data technologies like Hadoop to meet these challenges. We’ll discuss how we use tools like Hadoop, Hive, Pig, and Sqoop to power our web analytics framework. We’ll also discuss how big data along with an agile organizational structure provides immense opportunity to build a platform that can be used to drive the business to the next level.