Lab 2 Amazon Web Services (AWS) Setup Instructions
DSAN 6000 Fall 2025
Make sure that you are connected to the Saxanet WiFi network and not the GuestNet network. SSH (TCP port 22) is blocked on GuesNet which means if you are on GuesNet you wil not be able to connect to your cloud VMs or clone repos from GitHub via SSH. Use SaxaNet. #
Follow these instructions step-by-step to setup your AWS environment. The screenshots may look a bit different than what you are seeing, but the flow is the same.
You can click on any screenshot on this page to see its full size version.
Task 1: Accept the Invitation to join the AWS Classroom (the first time)
In your student email inbox, you will have an email from AWS Academy with the subject Course Invitation.
Open the email. Click on the Get Started button as shown in the screenshot below. #
Click on the Create My Account button to create a new Canvas Account (note that this canvas account is different from your existing Georgetown canvas account). #
You should now be logged into AWS Academy Learners Lab and seeing a screen like the one shown below. Click on Learners Lab #
Scroll all the way to the bottom of the page and accept the Terms & Conditions. #
You are now being logged in into the AWS Console. Notice the ⬤ adjacent to the word AWS towards the top left of the page. This indicates that the lab (i.e. the AWS resources) have not yet started. Click on the play button along side Start Lab on the to top right corner of the page to start the lab. #
#
Each lab session that you start is at most 4 hours and you can see a timer showing the remaining time (hh:mm) on the ribbon along with the remaining budget out of the $50 allocated to each account. #
This environment is long-lived. When the session timer runs to 0:00, the session will end, but any data and resources that you created in the AWS account will be retained. If you later launch a new session (for example, the next day), you will find that your work is still in the lab environment. Running EC2 instances will be stopped and then automatically restarted the next time you start a session. SageMaker notebook instances will be stopped, but not restarted the next time you start a session.
Monitor your lab budget in the lab interface above. Whenever you have an active lab session, the latest known remaining budget information will display at the top of this screen. This data comes from AWS Budgets which typically updates every 8 to 12 hours. Therefore the remaining budget that you see may not reflect your most recent account activity. If you exceed your lab budget your lab account will be disabled and all progress and resources will be lost. Therefore, it is important for you to manage your spending. Read about how to preserve your budget.
Login into the AWS Console
The AWS Console is your entry point into the AWS cloud.
A new tab will open in your browser, this is the AWS Console. #
Note the URL in your browser’s address bar, it will start with the name of the AWS region (such as us-east-1) in which your cloud resources are hosted.
Note the username on the top right hand corner, this is your Federated Identity. Also note that the you did not have to provide any credentials (username/password) to login into the AWS console. How did this happen? #
Logging into the AWS Console at a later time
To access the AWS Console in the future, login to https://www.awsacademy.com/LMS_Login, go to Learner Lab -> Modules -> Start Lab.
If you already had an AWS account prior to logging into AWS Academy you would need to login into the AWS Educate AWS account via an Incognito Browser Window.
Task 2: Create a SageMaker Studio Domain and Notebook
The Amazon SageMaker Studio is one of the data science tools that we will be using during this course. We will set a SageMaker Studio domain and start a Jupyter Notebook as part of this task so that we can use it for future labs.
All screenshots on this page are clickable i.e. you can click on these to see an enlarged version of the screenshot for better clarity.
Pay special attention to portions of the screenshot highlighted in a red rectangle, they usually correspond to non-default options. If you miss any of these your SageMaker domain would not be created correctly and you may have to repeat the entire procedure.
Enter a name for your domain as shown above. Click on next
On the next page, do the following:
- Disable both settings under MLOps configuration.
- Under Canvas ready to use model configuration section, select “Use existing role” -> “Custom IAM role ARN” and in the edit box type the ARN of the LabRole role. The ARN of the LabRole is of the form
arn:aws:iam::your_account_id:role/LabRole
, copy paste this string in the Custom IAM role ARN box and then replaceyour_account_id
with the account id of your AWS account. You can find the account id of your AWS account in the top right corner of your browser window (it is highlighted in the red rectangle in this screenshot, enter it without the “-”, for example987733172745
).
Select Public Internet Access, follow the screenshot and click on next. Note that the subnet and security group ids for your account would be different from what you see in the screenshot. Select all the subnets in the drop down (we will discuss in class why this is required).
Change the
Default space size
to 50 from the default of 5 and click next.Scroll down and click on next. Your Domain will now be created.
It takes the domain sometime to get created and in use, though it says to not refresh the page, you can go ahead and refresh the page after a minute or two. You will see the screenshot below stating that the domain is in use.
Part-3: Creating the User
Now that you have created the domain, It is time to create a user in that domain. Easiest way to do this is to navigate to the Studio page on the sidebar and Clicking “Create user profile”
Name your user, as a matter of convention set the username as your NET ID. Click on next.
- Follow the screenshot.
- Disable all canvas permissions and Paste the ARN of your LabRole. Scroll Down
Launching Sagemaker Studio
- Click on Domains on the sidebar. Select your domain and click on User Profiles.
- Select launch and Click on Studio
This is called a managed service, you get to work on the notebook without having to deal with the creation steps for the underlying infrastructure. You can easily change this configuration as per the requirements of the analytics/machine learning job you are running.
- This should open up
Task 8: Use SageMaker Studio Notebook for analyzing a simple dataset
We will analyze the a simple dataset using our freshly created SageMaker Studio environment.
The URL of the git repo would be of the format shown in the screenshot below.
Once the repo is cloned, double click on the repo folder and then on the analytics.ipynb file in the repo. This will open the Jupyter notebook in a SageMaker Studio Notebook.
The notebook downloads the dataset from your GitHub and has code to create an S3 bucket in your account and copy the dataset there. There are some tasks to be completed in the noteobok. Complete tasks marked in the notebook and checkin the completed notebook in the git repo.
Shutting down SageMaker Spaces
It is important to shutdown SageMaker Studio when not in use so that you do not get billed for it when you are not using it.
At the end of this lab:
Make sure you shutdown SageMaker Studio.
Make sure you shutdown EC2 VM.
Make sure you have ended the lab in AWS Educate by pressing the End Lab button.
(Optional) Using VSCode Instead of Jupyter Notebooks
SageMaker Studio Has added support for VSCode and the steps to run it are below:
Navigate back to Sagemaker Studio, Select Code Editor, Click on Create Code Editor Space
Now select the instance type you want to use. Click on Run Space. Wait for a while then click on Open Code Editor.
- Now you can start using VS Code as you do on your local machine.
This service also has to be shutdown as jupyter notebook. Since you only have 50\(, not shutting a notebook down, can cost upto 20\) a day for instances such as ml.g4dn.xlarge
. It is very important to shutdown the space after working on it. If you spend 50$ on this account, YOU WILL LOSE ALL DATA. NO, you cannot recover the data after the account is shutdown. Therefore it is recommended to make regular commits to github as you work on your assignments or labs.