ACTL3142 & ACTL5110 Statistical Machine Learning for Risk Applications
Tutors: Mohammad Hossein Nezhadhaghighi, Nathan Tan
What will I get out of taking this course?
Source: DALL-E generated image.
The schedule for each week’s topics are:
Regression theory
Machine learning
Lecture Outline
Learning activities
Assessment
Past Student Feedback
The learning activities of this course involve the following (besides additional self-revision):
At https://unsw-risk-and-actuarial-studies.github.io/ACTL3142/
Also available by https://laub.au/ml
James, G., Witten, D., Hastie, T., Tibshirani, R., An Introduction to Statistical Learning with Applications in R, Springer, 2nd version, 2021
Check the course website for the reading list. Currently it starts with:
Week | Required | Optional |
---|---|---|
1 | James et al. (2021): Chapter 1, Chapter 2, Chapter 3 up to and including 3.1.1 Estimating the coefficients | - |
2 | James et al. (2021): Rest of Chapter 3, Chapter 6.1 | - |
3 | James et al. (2021): Chapters 4.1, 4.2, 4.3, 4.6, 4.7.1, 4.7.2, 4.7.6, 4.7.7 | De Jong & Heller (2008): Chapter 4 |
4 | De Jong & Heller (2008): Chapters 5.1, 5.2, 5.3, 5.4 | De Jong & Heller (2008): Chapter 5.5 |
5 | De Jong & Heller (2008): Chapters 5.6, 5.7, 5.9, 5.11 | De Jong & Heller (2008): Chapter 5.10; @haberman1996generalized |
The live “face to face” lecture will be 2 hours each week (not 3), Monday 9-11am. Where needed, we will provide extra lecture recordings (whenever 2h of in-class time is not enough to cover all content). We will use the scheduled Tuesday 1h lecture as a consultation period.
They are on the course website.
Have a go at them yourself, after the lecture but before your tutorial.
The forum will be the course’s primary mode of communication. Questions about the theory, the running of the course, or the assignments should be asked on the Ed forum. You have to open the Ed forum at least once in order to be enrolled and thus to get any announcements. Don’t ask personal questions on the forum. Email those questions to the head tutor who may escalate them to the lecturers.
The benefits of asking questions on our Ed forum:
Before asking a question on the forum, try the following steps:
We only ask you to try first, as we don’t want the forum to become your crutch. After all, the forum won’t be available in the final exam. If you find someone else’s question that you know the answer to, please try to answer it directly. As mentioned before, the forum is designed to be a community and safe space. Please contribute to our community. Plus, you will learn the content very well if you can solve someone else’s problem on the Ed forum!
Lecture Outline
Learning activities
Assessment
Past Student Feedback
The formative assessments are designed to encourage you to engage with the course content and start experimenting early, without the fear of needing to be 100% correct. You will complete them by submitting posts to specific Moodle forums.
There are two steps for each StoryWall:
Read the full instructions on this part of the ACTL3142 Moodle page or on the ACTL5110 Moodle page.
Both components must be completed to get the grade (and there is no half-grades). Take specific note of attempting every component and giving specific feedback. Missing a component or giving non-specific feedback, even if the rest of your submission was very high quality, will result in a 0.
The comments from your peers will be the primary source of feedback you will receive on these formative submissions. With that in mind, you should give imagine the kind of feedback you wish to receive from your peers and give that level of quality feedback to someone else.
There are 5 StoryWall tasks, each worth 3% each. The best 4 of 5 being counted, maxing out at 10%. They are due on Friday at 11:55 am in Weeks 3, 5, 7, 9, 10.
You should realistically aim to get 10/10 marks for this…
Due Week 5 Friday 11:55 am
This will be a data science task using the techniques you learn from the regression theory half of the course.
Due Week 9 Friday 11:55 am
This follows on from the previous task using the techniques you learn from the machine learning half of the course.
All due dates are at Friday 11:55 am of the following weeks:
If submitting late, you must apply for special considerations through UNSW central system. If you ask us for an extension, we will refer you to the special considerations system.
Without special consideration, late StoryWalls will not be marked. I have noticed that special considerations will not be granted for StoryWall tasks if you can still get full marks without that task.
For the project, the general policy is:
Late submission will incur a penalty of 5% per day or part thereof (including weekends) from the due date and time. An assessment will not be accepted after 5 days (120 hours) of the original deadline unless special consideration has been approved.
Report Part 2 (worth 21% course grade) is due Week 9 Friday 11:55 am.
If you submit without special consideration on:
E.g. a submission on Saturday 11:56 am (10% penalty) which was graded as 80/100, would be recorded as 72/100, and hence an overall course grade of 15.12% out of the maximum 21%.
However, as a special case just for Project Report Part 1, we will not apply the 5% per day penalty for the first 72 hours after the deadline.
Report Part 1 is due Week 5 Friday 11:55 am.
If you submit without special consideration on:
Do not send or show your work to another student. You will be penalised along with them!
If you use something directly from an external resource, cite the source.
You will add a “Generative AI usage” appendix to your reports, detailing how you used AI (what outputs, what prompts).
If you do not use AI, then you will still need the appendix that says that.
Source: DALL-E generated image.
We have many years of past exams for you to practice on, though the last two exams are the most appropriate for the current edition of the course.
Source: DALL-E generated image.
Lecture Outline
Learning activities
Assessment
Past Student Feedback
What were the best things about this course?
“This was one of the best ACTL courses I have ever taken. It has stimulated my interest in data analytics and modelling and I am so glad that the course is structured the way it is. Please do not change anything about the course or the lecturers, they were amazing.” (2023 T2)
“the textbook readings were super helpful in getting us to read the concepts in more detail” (2023 T2)
“There were a few things I really loved about this course. The course content wasn’t too mathematics focused. It was nice to take a break from all the hardcore calculus. This my first ever machine learning course and it was very cool to see how machine learning works.” (2023 T2)
“The practical application of techniques on real data was stimulating.” (2023 T2)
“The content was really interesting and very practical. Really sparked my interest in data science and modelling in general.” (2023 T2)
“Refreshing focus on the theoretical and application aspects and constructions of models rather than mathematical/numerical focus.” (2023 T2)
“I really enjoyed practicality of the course, where i felt it is something I could genuinely use in the workforce.” (2024 T1)
“Reflection: This course has been great! Personally the ACTL3142 course introduced me to statistical machine learning and data analysis techniques used in actuarial work. However, as I delved deeper into the course, I found myself more drawn to the underlying mechanisms of these predictive models and statistical learning techniques. I became more interested in the ‘how’ rather than the ‘what’. This curiosity led me to explore the field of computer science, which forms the backbone of these data analysis techniques.This curiosity led me to explore computer science, which provides a deeper understanding of these tools. This shift in interest from application to creation is why I’m considering changing my degree from actuarial to computer science!” (2023 T2)
What could be improved?
“Everything. ACTL courses is meant to be all about applying mathematical theory, we arent compsci people who writes 500+ lines of code per assignment. We are not enrolled in compsci degrees for a reason but now we are forced to do a half theory half compsci course. The theory isnt math heavy either, its more about learning and interpreting models, instead of performing calculations. Even if you want us to do heavy assignments like this one, at least give us R tutorials which cover all the relevant code that we need???? We have compsci’s heavy assignment but no spoon feeding code like compsci lectures at all????? Also, 30% for the assignment is way too much, we want the 70% finals instead of 60%.” (2023 T2)
This course will require you to complete a substantial amount of programming. It is not a computer science course by any measure, but for some students (depending on your background) this will be quite a big step up in the size and difficulty of the coding tasks.
Statistical/machine learning coding is a very enjoyable component of your whole degree, though simultaneously it can be one of the most frustrating parts. There is no way to progress in your skills without simply making a lot of mistakes and learning from them. We recommend that you view this course as a safe environment to learn these skills, and potentially make some mistakes now, so that you have the capability and the confidence to perform make informed decisions later in your career based on rigorous understanding of the data.
Your future career will contain a sizeable data science / statistical programming element. This is a key part of your chosen profession, and in fact, if you complete this course and realise that you are truly averse to any coding, then it perhaps worthwhile to reflect on your career path now rather than have expectations which are unsatisfied in the future.
Is there anything you wish you could have told your former self before starting to help them be prepared to learn these topics?
“As a college returner, I’ve found the course immensely valuable. It has opened my eyes to the field of machine learning, and I’ve garnered many ideas while completing the assignments. However, lacking any background in R programming has made adapting to the lab materials quite challenging. If I could offer advice to my past self regarding course preparation, I would suggest learning R first.” (2023 T2)
“If I could have told myself one thing before starting this course, it would have definitely been to manage my time effectively and spend more time on exploring further questions in coding through R as I particularly struggled to manage my workload when working on the assignment and would have benefitted if I started it earlier or had a much better understanding of coding in R.” (2023 T2)
“Something i wish i could have told myself at the start of the term was to truly learn and understand the R codes behind modelling and what each function means.” (2023 T2)
“If I could advise my past self on something, I would say brush up on R before beginning the course, and get better at using if functions mainly.” (2023 T2)
“If I were to go back in time and tell myself one thing, I would definitely tell myself to not underestimate the time it takes to code these models, as they took significantly longer than I expected - having a deep understanding of how these models work is important, yes, but so is learning how to code them so they run quickly and efficiently in R.” (2023 T2)
“If there was anything I wish I could have told my former self before starting, it would be to start studying and try to stay as up to date as possible as I believed that due to being behind, I missed out on some details that I should have incorporated into the assignment and would cost me marks. Also, I would’ve told myself to learn how to write a report properly.” (2023 T2)
“Furthermore, I would have told myself to keep more on top of the course readings as I had missed some throughout the term. Additionally, I could have spent more time on some content I found difficult such as the unsupervised learning content.” (2023 T2)
Source: Wikipedia, Rubber duck debugging