Data Programming

DAT 303
Closed
McMaster University Continuing Education
Hamilton, Ontario, Canada
Instructor
(14)
6
Timeline
  • May 13, 2024
    Experience start
  • May 25, 2024
    Project Scope Meeting (TBD)
  • June 29, 2024
    Midway Check-in (TBD)
  • August 17, 2024
    Final Presentation (TBD)
  • August 24, 2024
    Experience end
Experience
3/3 project matches
Dates set by experience
Preferred companies
Anywhere
Any company type
Any industries
Categories
Data visualization Data analysis Data modelling Data science
Skills
presentations elastic (elk) stack adult education mongodb logstash elasticsearch document-oriented databases computer science scalability data analysis
Student goals and capabilities

This course is part of the Data Analytics certificate program. Students in the program are adult learners with a post-secondary degree/diploma in computer science, engineering, business, etc.This course is designed to present the fundamental concepts and theories in Data Analytics and promote the application to the workplace and professional practice. 


Students begin with an exploration of MongoDB which is a document database with scalability and flexibility for queries and indexing, and progress to the ELK stack – a technology stack used for logging with different components, such as Elasticsearch, Logstash, and Kibana.


Course activities will include instructor presentations, required readings and experiential 

learning activities (i.e. case studies, group discussions, projects, etc.).


Students
Continuing Education
Any level
20 students
Project
40 hours per student
Students self-assign
Teams of 2
Expected outcomes and deliverables

The final project deliverables will include:

  • A report on students’ findings and details of the problem presented
  • Future collaboration ideas will be identified based on current project outcomes
Project timeline
  • May 13, 2024
    Experience start
  • May 25, 2024
    Project Scope Meeting (TBD)
  • June 29, 2024
    Midway Check-in (TBD)
  • August 17, 2024
    Final Presentation (TBD)
  • August 24, 2024
    Experience end
Project Examples

The project provides an opportunity for businesses and learners to collaborate to identify and translate a real business problem into an analytics problem. 

The projects, which can be short, will allow the student to apply the skills acquired on the various tools to address the business problem. Students also learn how to implement real-time scenarios. A review of different Cloud providers will also be covered.

Some examples of potential projects:

  • Identify the difference between No SQL and RDBMS
  • Deploy and manage Elasticsearch clusters
  • Develop search and analytics solutions
  • Develop scalable and cost-efficient applications with MongoDB
  • Build MongoDB data models for enterprise applications
  • Implement Logstash as a log management tool
  • Use Kibana together with Elasticsearch for visualizations

You should submit a high-level proposal/business problem statement including relevant data sets and definitions, a list of acceptable tools (if applicable), and expected deliverables. Business datasets could be provided based on a non-disclosure agreement or in an anonymized/synthetic data format that is relevant to your organization and business problem. The course instructors will review the documents to confirm the scope and timing of the proposed problem and its alignment with the capstone course requirements.


Analytics solution may be applicable for (however they are not limited to) the following topics:

  1. Demand for social services (healthcare, emergency services, infrastructure, etc.)
  2. Customer acquisition and retention
  3. Merchandising for trade areas (categories)
  4. Quantifying Customer Lifetime Value
  5. Determining media consumption (mass vs digital)
  6. Cross-sell and upsell opportunities
  7. Develop high propensity target markets
  8. Customer segmentation (behavioral or transactional)
  9. New Product/Product line development
  10. Market Basket Analysis to understand which items are often purchased together
  11. Ranking markets by potential revenue
  12. Consumer personification

To ensure students’ learning objectives are achieved, we recommend that the datasets are at least 20,000+ rows in size. Data need to be ‘clean’. If more than one database is provided, which must be conjoined, students will be required to integrate them. This supports the learning experience and minimizes partner data preparation.


Companies must answer the following questions to submit a match request to this experience:

Be available for a quick phone call with the organizer to initiate your relationship and confirm your scope is an appropriate fit for the experience. Advise the instructor if students will be required to sign an NDA prior to beginning the project.

What's your dataset size? Please note that ideally the datasets should be at least 20,000+ rows in size.

Share feedback and recommendations about the project deliverables with the students and instructor.

Provide a dedicated contact who will be available to answer periodic emails or phone calls over the duration of the project to address student’s questions or provide additional information. Minimum of 2-4 interactions with each student group leader (approximately 4-6 hours over the duration of the project). Let the students/instructor know if you will be away for an extended time (e.g., vacation).