ADMI2011
ADMI2011 ADMI CERSER Community Grids Lab CReSIS IU ECSU  
 

ADMI 2011 Cloudy View of Computing Workshop
Introduction

Cloud computing provides elastic compute and storage resources to solve data intensive scienceand engineering problems, but the number of students from under-represented universities who are involved and exposed to this area is minimal. In order to attract underserved students, we trained faculty members from the Association of Computer/Information Sciences and Engineering Departments at Minority Institutions (ADMI) in the area of cloud computing through a one-week workshop conducted on the campus of Elizabeth City State University. This workshop enabled faculty members from underserved institutions, who are involved with minority undergraduate students to gain information about various aspects of cloud computing while serving as a catalyst in propagating their knowledge to their students.

ADMI-Cloud WorkshopWorkshop Development
The participants of the ADMI faculty workshop have diverse backgrounds; so, to better serve them in the area of cloud computing for data-intensive applications, a preliminary discussion was hosted at the ADMI conference during April 14-16, 2011 before an in depth one-week session in June; this provided participants with an understanding of several practical applications, and it scoped the participant’s relevance/expertise domain. The desired competencies for faulty members to acquire and/or refine in cloud computing were:

Understand and articulate the challenges associated with distributed solutions to large-scale problems, e.g., scheduling, load balancing, fault tolerance, memory and bandwidth limitations, etc.

  • Understand and explain the concepts behind MapReduce
  • Understand and express well-known algorithms in the MapReduce framework.
  • Understand and reason about engineering tradeoffs in alternative approaches to processing large datasets.
  • Understand how current solutions to the particular research problem can be cast into the MapReduce framework.
  • Explain the advantages in using a MapReduce framework over existing approaches.
  • Articulate how adopting the MapReduce framework can potentially lead to advances in the state of the art by enabling processing not possible before.

Hands-on Workshop
The hands-on workshop was June 6-10, 2011. Participants were immersed in a “MapReduce boot camp”, where ADMI faulty members sought introduction to the MapReduce programming framework. The following were themes for five boot camp sessions:

  • Introduction to parallel and distributed processing
  • From functional programming to MapReduce and the Google File System (GFS)
  • “Hello World” MapReduce Lab
  • Graph Algorithms with MapReduce
  • Information Retrieval with MapReduce

An overview of parallel and distributed processing provided a transition into the abstractions of functional programming, which introduces the context of MapReduce along with its distributed file system. Lectures focused on specific case studies of MapReduce, such as graph analysis and information retrieval. The workshop concluded with a programming exercise (PageRank or All-Pairs problem) to ensure faculty members have a substantial knowledge of MapReduce concepts and the Twister/Hadoop API.

Source of Computing Resources
FutureGrid is an experimental testbed, which uses virtualization and provisioning of Infrastructure-as-a-Service to provide unique capabilities in deploying customized environments for experiments in grid and cloud computing. It has been leveraged to create a self-contained, flexible, plug-and-play educational "virtual appliance". Academic institutions with restrictive computational resources can use virtual appliances enabling students the opportunity to easily experiment with cloud technology. The virtual appliances support multiple virtualization technologies allowing them to run on a variety of resources, including user workstations and desktop grids. The “Cloudy View on Computing” workshop will highlight two specific educational virtual appliances detailing different middleware stacks used actively in clouds: Hadoop and Twister, each with varying data-intensive applications.

 
Participants
ADMI ADMI ADMI ADMI ADMI

DeShea Simon
Hampton University

Timothy Holston
Mississippi Valley State University

Mohammad Hasan
Elizabeth City State University

Constance Bland
Mississippi Valley State University
Candace Adams
Auburn University
         
ADMI ADMI ADMI ADMI ADMI
Felicia Doswell
Norfolk State University

Yenhung Hu
Hampton University

Willie Fuller
Norfolk State University
Natarajan Meghanathan
Jackson State University
Darnell Johnson
Elizabeth City State University
ADMI-Cloud Workshop
ADMI-Cloud Workshop
ADMI-Cloud Workshop
 
CReSIS  IU   CERSER   ECSU