Skip to main content

Configuring a high-performance cluster, with virtual machines, to simulate Hadoop multi-node system for Data Science experiences

Submission Number: 134
Submission ID: 237
Submission UUID: 56e75a52-dd01-49dd-bb82-8616ea97d9f3
Submission URI: /form/project

Created: Thu, 01/13/2022 - 11:07
Completed: Thu, 01/13/2022 - 11:07
Changed: Wed, 07/06/2022 - 15:09

Remote IP address: 192.112.102.251
Submitted by: Gerald Kruse
Language: English

Is draft: No
Webform: Project
Configuring a high-performance cluster, with virtual machines, to simulate Hadoop multi-node system for Data Science experiences
CAREERS
{Empty}
cluster-management (495), hadoop (12), software-installation (211), unix-environment (60)
Halted

Project Leader

Gerald Kruse
814-644-9206
814-641-3595

Project Personnel

{Empty}
{Empty}
{Empty}

Project Information

Our Data Science high-performance cluster was delivered in Jan 2020. It is a Cloudseek 1000 from PSSCLabs.
Unfortunately, Covid impacted our efforts to configure it for our Data Science courses (https://www.juniata.edu/academics/departments/data-science/curriculum.php). At Juniata, we offer a Major (our "Program of Emphasis"), a minor (our "Secondary Emphasis"), and an online graduate degree in Data Science. We've been able to get by, but with a Big Data course coming available, we need to configure this system. We would like funding for one of our students to work on this project. We have the name of a possible technical mentor, or at least someone who will need to be consulted.
It's been a challenge to get this cluster operational, and we would really appreciate any assistance.

Project Information Subsection

{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
CR-Penn State
{Empty}
No
Already behind3Start date is flexible
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}

Final Report

{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}