--- tags: data-science computational-methods machine-learning --- # Data Science & Cloud Computing ## What is Data science See how big companies define and introduce data science: - Amazon AWS: https://aws.amazon.com/what-is/data-science/ - IBM: https://www.ibm.com/topics/data-science - NIH (National Library of Medicine): https://www.nnlm.gov/guides/data-glossary/data-science ## What is the process of Data Science The data science lifecycle starts with the business or research understanding of the problem of interest, and proceeds through an iterative and cyclic process of accessing the data, deploying the models, updating those models, and returning back to our understanding of the problem. CPU-GPU Source: https://learn.microsoft.com/en-us/azure/architecture/data-science-process/lifecycle ## What is the difference: Data science vs. machine learning: CPU-GPU Source: https://www.coursera.org/articles/data-science-vs-machine-learning ## What is Clound Computing in Data Science? CPU-GPU Source: https://www.qlik.com/us/cloud-analytics ## Glossary of AI, ML, Data Science and Cloud Computing: To learn more about these tools, click below: https://cloud.google.com/discover?hl=en ## Databricks and Apache Spark For an example of how to use data science tools such as Databricks and Apache Spark, please see these training materials below: https://tufts.box.com/v/DatabricksSparkSQLWorkshop ## Resources at Tufts Please see the link below for full screen access: https://sites.tufts.edu/datalab/learning-statistics/stats-online-tutorials/