Title:Apache Big Data Stack (Summer: 2014)
Mentors: Scott McCaulay
Keywords: Apache Big Data Stack, Chef, FutureGrid, Big Data
Abstract: As the amount of data generated around the world continues to accelerate by the second, the more the term Big Data finds its way into scientific conversation. Because of this tremendous surge, it has become imperative that such mass data use “computing power and space” for it to be processed, analyzed, and serve other purposes [1]. Hence, in order to meet head-on the enormous challenges rendered by Big Data, open source software from the Apache Foundation is evaluated as a “Big Data Stack” to support scientific computing. The approach to handling the complications surrounding Big Data involve installing and testing as many open-source software packages from the Apache Big Data Stack as possible on FutureGrid machines and later making those packages accessible utilizing Chef. The packages will be built into projects and from that point Chef will be used to transform the infrastructure of each project’s code, making it agile and accessible through a network of servers [2]. Essentially, this research will demonstrate how the Apache Big Data Stack can be used and applied to solve complex problems regarding Big Data.
Title:Configuring and Customizing the HUBzero Experience (Academic School Year: 2013-2014)
Team Members: Antonio Guion, Tatyana Matthews, Nigel Pugh
Mentors: Je'aime Powell, Justin Deloatch
Keywords: HUBzero, gateway user experience
Abstract: HUBzero is an open source software package used to construct websites for scientific research and educational activities. HUBzero was originally created by researchers at Purdue University in conjunction with the National Science Foundation (NSF) who sponsored the Network for Computational Nanotechnology to support nanoHUB.org. The HUBzero platform currently supports over 40 hubs across a variety of disciplines, including cancer research, biofuels, climate modeling, water quality, education, and more.
The team investigated how HUBzero features are utilized for research, education, and scientific collaboration. The project involved configuring and customizing the user experience on a new hub. The team also learned how to work with simulation workspaces, plus the process of allowing groups to collaborate. Finally, the team learned how to publish the hub so that it could be viewed publicly and how to use the new database component.
To accomplish this, the HUBzero team members used data collected by the 2013 Research Experience for Undergraduates Pasquotank River Watershed Team who completed tests of five tributaries and the river itself. Streams tested were Newbegun Creek, Knobbs Creek, Areneuse Creek, Mill Dam Creek, and Sawyers Creek. The team uploaded test data to a database to determine how HUBzero handles databases. |