Srijith Rajamohan, Ph.D.

Publishing Jupyter Notebooks using Gatsby and Netlify

A quick overview

Posted on November 11, 2018

Reader level: Introductory Build a Gatsby website using the following command. This will start a server running at port 8000, navigate using your browser. You can also access the GraphQL query page at localhost:8000/___graphql. gatsby develop Once you are done developing, you can build this website so it can deployed to a server such as Netlify or Gitlab pages. gatsby build Once you have the above you can go ahead and set up your Netlify account and link your current folder. [Read More]

#Jupyter Notebook #Publishing #Gatsby #Netlify

SuperComputing18 Presentations

Slides

Posted on November 11, 2018

The slides below were used for presentations at the SuperComputing 2018 conference in Dallas.

Overview of PyTorch

The posts associated with these slides can be found here and here.

Quick introduction to AutoML

The post associated with these slides can be found here. Note that this is still work in progress and will be updated periodically.

#AutoML #PyTorch

AutoML

An overview of Automated Machine Learning

Posted on October 10, 2018

Reader level: Intermediate Disclaimer: This post is work in progress and will be updated periodically. This is not meant to a comprehensive overview of the topic, but more of an introduction to AutoML, some tools and techniques. Overview Finding a model that works for a specific problem or a class of problems can be a time-consuming task. Usually, an engineer or a scientist determines what model class to use either based on his prior knowledge of the problem at hand or by evaluating several models and picking the best one. [Read More]

#AutoML #Automated Machine Learning #Hyperparameter optimization #Bayesian

Gaussian Process Regression (Draft)

Uncertainty quantification

Posted on October 10, 2018

Reader level: Advanced Gaussian Distributions A Gaussian distribution exists over variables, i.e. the distribution explains how (relatively) frequently the values for those variables show up in observations. A Gaussian distribution for a n-dimensional vector variable is fully specified by a mean vector, μ, and covariance matrix Σ $$ \mathrm{x} = (x_{1},....x_{n})^{T} \sim \mathcal{N}(\mu,\Sigma) $$ A univariate Gaussian distribution is given by $$ p(x|\mu,\sigma^2) = \dfrac{1}{2\pi \sigma^2} e^{ \dfrac{ -(x - \mu)^2 }{2 \sigma^2} } $$ where μ is the mean and σ is the standard deviation for the Gaussian. [Read More]

#GP" #Uncertainty #Bayesian

Word2Vec in Pytorch - Continuous Bag of Words and Skipgrams

Pytorch implementation

Posted on September 9, 2018

Reader level: Intermediate Overview of Word Embeddings Word embeddings, in short, are numerical representations of text. They are represented as ‘n-dimensional’ vectors where the number of dimensions ‘n’ is determined on the corpus size and the expressiveness desired. The larger the size of your corpus, the larger you want ‘n’. A larger ‘n’ also allows you to capture more features in the embedding. However, a larger dimension involves a longer and more difficult optimization process so a sufficiently large ‘n’ is what you want to use, determining this size is often problem-specific. [Read More]

#NLP #Pytorch #GPU

CS4984/5984 Big Data Summarization

Class notes

Posted on September 9, 2018

Connecting to ARC machines Cascades The ARC cluster that will be used for this class is ‘Cascades’. Detailed instructions on how to access this machine can be found here. A quick overview of how to login and submit jobs is given below. To login: ssh username@cascades1.arc.vt.edu where username is your PID and your password is the VT PID password followed by a comma and the two-factor six-digit code. For e.g. the password looks like this: [Read More]

#NLP #Pytorch #GPU

Virtual environments for Anaconda Python

Useful conda commands

Posted on September 9, 2018

Installation using Conda To create a conda environment named ‘myenv’: conda create --name myenv To create an environment from a file ‘test.yml’: conda env create -f test.yml The environment name comes from the line ‘name: tag’ inside the ‘test.yml’ file. To create a named environment from a file ‘test.yml’: conda env create -f test.yml -n pytorch To create an environment from the base environment: conda create --name myenv --clone base To remove an environment named ‘envname’: [Read More]

#Python #Anaconda #Conda #Virtual Environment

OpenACC workshop

Class slides

Posted on July 7, 2018

Slides for the OpenACC class that I taught for Prof. Tim Warburton.

PEARC 2018 Workshop

Workshop slides and Jupyter Notebooks

Posted on July 7, 2018

This page contains the materials for the workshop ‘Introduction to Machine Learning’ which has been accepted to be presented at PEARC18. Participants would have access to a server running the relevant Python 3 installation along with the tools Tensorflow and Keras. If you would like to install your own environment, please check the bottom of this page to download an conda environment file that can be used for configuration. The Jupyter notebooks can be downloaded from here. [Read More]

Introduction to Machine Learning

NLI Class slides

Posted on April 4, 2018

This page contains slides to the ‘Introduction to Machine Learning’ NLI class series that I have taught at Virginia Tech.

#teaching #machine learning #scikit-learn #tensorflow #deep learning

About

Publishing Jupyter Notebooks using Gatsby and Netlify

A quick overview

SuperComputing18 Presentations

Slides

Overview of PyTorch

Quick introduction to AutoML

AutoML

An overview of Automated Machine Learning

Gaussian Process Regression (Draft)

Uncertainty quantification

Word2Vec in Pytorch - Continuous Bag of Words and Skipgrams

Pytorch implementation

CS4984/5984 Big Data Summarization

Class notes

Virtual environments for Anaconda Python

Useful conda commands

OpenACC workshop

Class slides

PEARC 2018 Workshop

Workshop slides and Jupyter Notebooks

Introduction to Machine Learning

NLI Class slides