My First CTF Challenge: Brute Forcing a Web Admin Page with Python
This post walks the reader through a fascinating process of investigation, discovery and solving the author’s first CTF challenge with Python!
Background
This past weekend I participated in a Capture The Flag (CTF) security event. CTFs are usually organized as educational competitions designed to give participants experience in various security challenges.
The CTF I signed up for was SarCTF. The competition was run “jeopardy style” where you could complete as many challenges as possible in 24 hours.
I was participating in this competition as a requirement of my coursework for my in-progress MS Cybersecurity degree.
From the syllabus:
You will be required to compete in at least one CTFTime-ranked CTF, and provide a writeup about at least one non-trivial problem that your team worked on.
I think this is a great assignment and gave our class an opportunity to flex our security muscles.
The Challenge
I wanted to focus on what I think I am good at so I chose to focus on the web challenges and proceeded to solve the “Admin” CTF challenge which was worth 982 points.
Challenge Modal
Below is a screenshot from the challenge website, as you can see there isn’t a whole lot of information to go on - which is what makes these exciting! :)
Challenge Web UI
The challenge redirects us to the /admin
path on a website that has a login form, a forgot password button and a navbar with various elements.
Investigation / Spelunking
Initially I tried to submit some basic SQL injection (SQLi) attacks but didn’t see results.
When it was clear it wasn’t a basic SQLi attack, without fully ruling it out, I wanted to see what other clues there were. The first tool I usually reach for is the Network Pane in Chrome, which can provide more information about the web page.
Below is a view of different resources the site loads, displayed in the Network Pane. Two resources in particular caught my eye:
The first one seems to be a JS script from the server, ambiguously named script.js
and the other is an XHR request being made to users.checkAuth
.
Looking into script.js
, we get to see this snippet near the top of the file (with truncations for brevity) which shows that the XHR request in question is executed on page load:
|
|
Further below, in another excerpt from script.js
, it seems this module functions as a “url router” where certain blocks of code are executed based on the path of the page.
In fact, it looks like the site is performing a lot of templating via JavaScript!
If you look more closely at the /admin
case
in this switch
statement, you can see that this is where the code block for the login form is coming from.
|
|
Further, from the snippet above, we see two more functions we can go through - adminAuth
and adminRestore
. These look important.
Discovery
If we look closely at the adminRestore
function (which is triggered by the Forgot your password?
button in the form):
|
|
Aha! We can see a while
loop based on a condition which triggers a prompt()
form allowing you to enter a six-digit code. Technically we can enter anything you like in the prompt()
, but lets trust the prompt text for now that it is indeed a six-digit code and try to brute-force this.
While the code speaks for itself, it is always useful to check the request-response cycle and see what happens when we trigger an event we are curious about. Luckily, the server we are trying to talk to uses HTTP, and not HTTPS, so we can snoop this traffic without an issue.
Most people recommend using some sort of proxy, but I am pretty comfortable with Wireshark/tcpdump so I clicked on Forgot your password?
and monitored what happened next:
On clicking the button we make an immediate post request with nocontent, which corresponds to api('admin.restore', undefined, true)
:
In the response we can see this JSON object in the data received which indicates we can continue requesting SMS code until we get it right!:
|
|
At this point its important to note to not make any assumptions about what the successful response data will be - the code has to accont for this ambiguity in the response. A good way to do that is to base the the probes on things that you do know about the problem.
Specifically in this case what we know is that a successful login will not have {"need_sms": true}
in the
response.
Solution - lets write some code!
If we break down the problem, we need to do two things while interacting with the service:
- Make a request to the
admin.restore
endpoint without any parameters and get two variables:need_sms
- this will betrue
until we hit the correct six-digit code.new_hash
- this is a “token” of sorts that we need for subsequent requests.
- Send the new
hash
we received with the next SMS code we want to try untilneed_sms
is not explicitlytrue
.
Before we can make requests, we need to be able to generate random SMS codes, which I would call a step 0.
Step 0: Generating SMS codes
Surprisingly, the first thing that came to mind was using something in the itertools
module.
I wanted various combinations for a set of numbers, and I found product()
to be up to the task.
I also use a generator below so if the operation terminates, I can restart the sequence from the termination point.
|
|
Step 1: Fetching the new hash
|
|
This block of code would retrieve the new_hash
to use with each request to try an SMS code.
Step 2: Using a new SMS code in the sequence to brute force this
|
|
We run this code with step 1, while iterating over the code in Step 0, and we’re done right?
Conceptually, yes! There were a few challenges along the way, especially since the address space turns out to be 1 million possilbe combinations. We will speak about this in the next section.
Run code and complete!
We see the program exits at 272273
. One can either get the output programmatically by printing out the response, or one can use the browser to key in the password and see what you get. This is what it looks like in the browser:
The programmatic output is shown in the next section.
Optimizations and challenges
On the first run of the script, it exited a couple of hours in due to the fact that we were overwhelming the server.
This is inferred based on the observation in the logs of response.status_code
of 502
instead of 200
as expected.
At the same time, a lot of time had passed and the combinations that had been tried numbered less than 100,000 which would make it difficult to actually complete the challenge in the time allotted for the CTF.
Two things were introduced to help optimize the attack:
argparse
(from the stdlib) to help specify a prefix while running the code, so it could be chunked into 10 independent datasets with 10 different python interpreters running (goroutines
might have been helpful here for easy concurrency..)requests
sessions andtenacity.retry
headers - taking no risks on the process exiting.
The running code with 10 python interpreters and the final flag looks like:
Conclusion & Learnings
- CTFs are really fun and a great way to get close to understand real world security vulnerabilities and the impact of exploits.
- As an application-focused SWE-SRE, it was clear this “vulnerability” could have been prevented with better systems design around the authentication system of an application. Good to think about when trying to secure applications in your own production environment - what else could be exploited?
- Do not expect applications, including CTFs to be resilient to dealing with high-scale. The CTF even started late due to other issues - its important to be considerate towards organizers who are also humans. As a practitioner, however, it is a good representative of how the real world might be and one should have resilient code which respectfully retries.
Overall, a great learning experience and I can’t thank my school enough for pushing me in this direction!
Appendix: Full source code
|
|