Maelstrom #7: Static OpSec Review

Breaking down the Maelstrom DLL and Loader to identify and discuss remediations for indicators-of-compromise.

PreviousHome NextMaelstrom #6: Working with AMSI and ETW for Red and Blue

Last updated 9 months ago

Maelstrom #7: Static OpSec Review

Breaking down the Maelstrom DLL and Loader to identify and discuss remediations for indicators-of-compromise.

Introduction

In the previous two blogs (Maelstrom #6: Working with AMSI and ETW for Red and Blue and Maelstrom #5: EDR Kernel Callbacks, Hooks, and Call Stacks), we've discussed five key mechanisms which Windows and third-party Event Detection and Response (EDR) programs use to evaluate a C2's implant and intervene with its operation by detecting behaviours. These are relatively new techniques, and can be very effective at detecting implants which have not been seen before.

While these can be very sophisticated, especially at the bleeding edge where attackers and defenders alike continue to scour and delve ever deeper into Windows itself in search of new techniques, it can be surprising how an implant or executable can run without much of this effort. It can seem redundant to develop a standalone Portable Executable (PE) file when you can simply run a reverse shell with PowerShell, let alone spending hours trawling through WinAPI calls.

We often find ourselves arriving at the same questions: Why are freshly written executables based on StackOverflow answer's on "how to write a reverse shell", clearly malicious, sometimes not detected? Why are completely new techniques immediately detected by vendors when they are uploaded to VirusTotal? The answer to both essentially lies within the static and dynamic analyses of these files. EDRs and sandboxes will evaluate implants memory and behaviour over time. If your new implant looks like a prior implant, or behaves like a prior implant, to the EDR it's probably an implant. With the growth of tools such as VirusTotal, data sharing between EDRs, and a constantly growing corpus of techniques even a "brand new" implant may unknowingly contain indicators of compromise.

Over the next two blogs, we will look at ways that we can detect an implant using static and runtime analysis, and consider ways which these can be evaded.

In this blog, we will focus on the static review by analysing our proof-of-concept C2, Maelstrom. We will be looking at the implant's Portal Executable (PE) and Reflective DLL and see where they break operational security (OpSec), and how we can attempt to address this.

To achieve static review, we are going to look at the PE Structure and some automated tools for indicator-of-compromises. We will compare a PE without any meaningful opsec practices (labelled "unsafe"), and a PE with opsec practices (labelled "safe"), to illustrate the impact of good opsec.

Objectives

This post will cover:

Reviewing the loads and imports of the PE and DLL
- How we can examine these files
- Looking for suspicious attributes
- How we can find and evaluate imports, functions, and strings
Reviewing the capabilities of the PE and DLL
- Using CAPA to look up their functions and behaviours against the MITRE ATT&CK framework and other catalogues
- How fresh implants behave on platforms such as VirusTotal
- Briefly examining what attributes vendor and crowdsourced rules trigger on
Reviewing the metadata of the PE and DLL
- How unassuming metadata can be suspicious
- How entropy and Authenticode can impact detection
- Looking at automated detection tooling, namely Intezer

As ever, we will not be outright releasing bypasses for these techniques. The implants we have developed are purely illustrative, and as part of this blog we have uploaded the files to VirusTotal, as well as developing YARA rules which will detect the implant in operation which we will publish in the next blog. This blog is also by no means exhaustive, and there are naturally more advanced filters and behaviours in use in the wild.

Finally, if you fancy getting your hands on our code early, help yourself to our VirusTotal samples!

Important Concepts

Portable Executable (PE)

We have previously discussed Portable Executables (PE) throughout this series, but to quickly recap:

Dynamic Linked Library (DLL)

Similarly, we've previously discussed DLLs, but briefly, a DLL is also based on the Portable Executable (PE) file format. DLLs allow for functions to be exported, and then this can be loaded into an application by using the LoadLibraryA call, or by statically linking the library. The functions exported can be for anything, isolating functionality making your code more modular. This makes it far more simpler to load objects into memory without using complex workarounds within the exe itself.

Reviewing PE modules and functions

When we are looking at the PEB, we aiming to review the loaded modules, imported functions, and strings associated with the file overall. Within malware, these are common areas for both indicators of compromise and a suspicious absence of indicators of compromise.

To locate this information, we are going to use the following three programs:

Loaded Modules and Imported Functions

The first thing we want to assess is the modules required by the implant. The easiest way to do this is to use CFF Explorer. When installed, right click the PE and click 'Open with CFF Explorer':

EXE (Both)

As both the loaders, PE Files, have 0 imports, the Import Directory should be empty. When in CFF Explorer, navigating to Import Directory should show nothing:

However, it is worth remembering that a file with 0 imports is a pretty high indicator that the implant is malicious and Anti-Virus Vendors have known about this technique for a long-time. This is one of the many reasons why implants avoid touching disk, but we aren't teaching Red Team Tactics.

We can achieve the same thing in PE Bear by opening the file, and finding the 'Imports' tab:

Reflective DLL

Moving onto the actual implant, the Reflective DLL. This has no position-independent code and relies on imports.

So, if we open this with CFF Explorer:

We make use of all of these libraries. To see the functions, simply click one of the module's table row:

In the lower window, we can see which of the module's loaded functions were identified and where they are within the PE. Going through the four modules, we can see why they are present.

The WinHTTP DLL is there because of this link:

#pragma comment(lib, "winhttp")

And then ADVAPI32 because of:

if (!GetComputerNameA(lpComputerName, &nSize))
{
    return NULL;
}

if (!GetUserNameA(lpUserName, &nSize))
{
    return NULL;
}

Finally, MSVCRT because of the malloc and sprintf calls:

char* data = malloc(MAX_PATH * 5);

[..]

sprintf(data, "{ \"init\": {\"processname\": \"%s\", \"computername\": \"%s\", \"username\": \"%s\", \"dwpid\": \"%ld\"}}", lpProcessName, lpComputerName, lpUserName, dwPid);

To get a list of 'blacklisted' functions and libraries, PE Studio has you covered:

This DLL never touches disk, so the importance of these are not critically important... however, it is still something that an operator should consider doing. Within the DLL, a simple macro, class, or function to quickly handle dynamic resolution would be acceptable without having to take extra obfuscating steps.

Strings

Strings are a classic mainstay of detection, and a number of popular YARA rules rely on them to quickly detect and attribute samples. As we will see in a later section, strings are used for lazy detections, and rightly so - they are quick, reliable, and work as a common denominator style check. Where tools are downloaded directly from GitHub or samples are written from blogs, gists, or StackOverflow responses, the defenders may as well write logic to detect them using stagnant strings.

[assembly: Guid("658c8b7f-3664-4a95-9572-a3e5871dfc06")]

Why not use this as a detection? This is not to say that ALL detections should be done this way, but the low-hanging fruit should be considered. Even then, a library of these detections can be further split in to differing levels of severity and confidence.

imp_Badger
BadgerDispatch
BadgerDispatchW
BadgerStrlen
BadgerWcslen
BadgerMemcpy
BadgerMemset
BadgerStrcmp
BadgerWcscmp
BadgerAtoi

These will now be used in detections by EDR... To reiterate, strings SHOULD NOT be the only detection logic for a sample, or attribution. But there's little reason why they should be used in at least one rule. With how specific some functions and strings are within malicious software, there is more to be gained from additional detections than would be lost from false positives.

EXE

With the EXE, it's position-independent so it won't have any imports but it will still have strings. Recall on how we obtain the function address at runtime:

typedef HMODULE(WINAPI* LOADLIBRARYA)(LPCSTR lpLibFileName);
CHAR cLoadLibraryA[13] = { 'L', 'o', 'a', 'd','L','i','b','r','a','r','y','A',0 };
Api->LoadLibraryA = GetSymbolAddress(hKernel32, cLoadLibraryA);

We pass the string in as an array to ensure that the LoadLibraryA strings makes their way into the .text section. This means that we should still see a few strings within our safe PE. Within our unsafe PE (where we disable protections using pre-processor definitions, so it is otherwise identical) we can see a bunch of our strings:

Alternatively, encrypt them. Masking strings comes down to creativity. A common tactic with PowerShell malware is using ComSpec to build the IEX string:

$env:ComSpec[4,15,25] -join ""

This will produce:

Iex

Final note: do not confuse the obfuscation of strings with the implants overarching data protection mechanism. If the solution is to XOR the strings, don't also use XOR to protect data over-the-wire...

Reflective DLL

Using PEStudio again on, but this time on the DLL:

As you can see, there are some glaringly suspect strings here.

In next week's runtime blog we will see a lot of these strings again within in the memory regions.

As we can see, the IP of 10.10.11.205, the initialization string, headers, and so on are visible.

When looking into publicly available C2s, and even proprietary ones, the full configuration was able to be extracted and clear-text information found. This is because the algorithm used was something commonly recognisable such as XOR, RC4, and other variations. To make it difficult for the researchers to reverse, Vulpes embeds the absolute minimum required for initial request. On the implant being loaded into memory, it identifies various pieces of information for Environmental Keying. These keys are used to build out the cryptography, protecting the configuration as much as possible.

CAPA

For more information on this utility, the following three blogs provide a great primer on using CAPA:

EXE (Unsafe)

First off, lets check out the "unsafe" executable. Remember, this variant has less logic because its not doing any sort of pre-execution checks. Amongst general PE information, this is the main match:

WCHAR wVerb[4] = { 'G','E','T',0 };
WCHAR wEndpoint[9] = { '/','a','?', 's', 't', 'a', 'g', 'e', 0 };
WCHAR wUserAgent[10] = { 'M','a', 'e', 'l', 's', 't', 'r', 'o', 'm', 0 };
WCHAR wVersion[5] = { 'H','T','T', 'P',0 };
WCHAR wServer[13] = { '1', '0', '.', '1', '0', '.', '1', '1', '.', '2', '0', '5',0 };
WCHAR wReferer[19] = { 'h', 't', 't', 'p', 's', ':', '/', '/', 'g', 'o', 'o', 'g', 'l', 'e', '.', 'c', 'o', 'm',0 };
WCHAR wHeaders[22] = { 'X','-','M','a','e','l','s','t','r','o','m',':',' ','p','a','s','s','w','o','r','d',0 };

This is something we expected. Although, the reason its matched it isn't what we was using it for, but either way; it found something.

EXE (Safe)

The "safe" variant has some extras steps. CAPA is able to spot this.

The code matching:

BOOL IsBeingDebugged()
{
    // Get the PPEB Struct
    PPEB pPeb = (PPEB)__readgsqword(0x60);

    // check if being debugged
    if (pPeb->BeingDebugged == 1)
    {
        return TRUE;
    }
    else
    {
        return FALSE;
    }
}

Reflective DLL

The DLL actually has a lot more data available:

With all this information about the loaders and DLL, it provides a good basis for the things that can/should be replaced. For example, 'allocate RWX memory'. That's a pretty big indicator-of-compromise which we will visit in the next blog.

Virus Total

This isn't something we would recommend you do with your implant, since it will immediately burn it, but this is a throw-away project and we feel its important to see how little indicators some things can have.

Reflective DLL

As of initial upload back in January 2022 our DLL matched some crowdsourced Yara Rules, but only had 2 vendor detections, Kaspersky and Microsoft:

Vendor

Gene

Kaspersky

HEUR:HackTool.Win32.Inject.heur

Microsoft

Trojan:Win32/Sabsik.TE.B!ml

Let's walk through these and see where they were triggered.

Yara Rules

The reflective loader Yara rule consists of the following:

rule ReflectiveLoader {
   meta:
      description = "Detects a unspecified hack tool, crack or malware using a reflective loader - no hard match - further investigation recommended"
      reference = "Internal Research"
      score = 70
      date = "2017-07-17"
      modified = "2021-03-15"
      author = "Florian Roth"
      nodeepdive = 1
   strings:
      $x1 = "ReflectiveLoader" fullword ascii
      $x2 = "ReflectivLoader.dll" fullword ascii
      $x3 = "?ReflectiveLoader@@" ascii
      $x4 = "reflective_dll.x64.dll" fullword ascii
      $x5 = "reflective_dll.dll" fullword ascii

      $fp1 = "Sentinel Labs, Inc." wide
      $fp2 = "Panda Security, S.L." wide ascii
   condition:
      uint16(0) == 0x5a4d and (
            1 of ($x*) or
            pe.exports("ReflectiveLoader") or
            pe.exports("_ReflectiveLoader@4") or
            pe.exports("?ReflectiveLoader@@YGKPAX@Z")
         )
      and not 1 of ($fp*)
}

And similarly, INDICATOR_SUSPICIOUS_ReflectiveLoader comprises:

rule INDICATOR_SUSPICIOUS_ReflectiveLoader {
    meta:
        description = "detects Reflective DLL injection artifacts"
        author = "ditekSHen"
    strings:
        $s1 = "_ReflectiveLoader@" ascii wide
        $s2 = "ReflectiveLoader@" ascii wide
    condition:
        uint16(0) == 0x5a4d and (1 of them or (
            pe.exports("ReflectiveLoader@4") or
            pe.exports("_ReflectiveLoader@4") or
            pe.exports("ReflectiveLoader")
            )
        )
}

We can see that both of these are matching on the following DLL Export:

DLLEXPORT ULONG_PTR WINAPI ReflectiveLoader(LPVOID lpParameter)

Naturally, this can be updated to either this:

DLLEXPORT ULONG_PTR WINAPI SomethingCompletelyDifferent(LPVOID lpParameter)

Or simply:

DLLEXPORT ULONG_PTR WINAPI StartEx(LPVOID lpParameter)

And we can evade the Yara detection.

Kaspersky's HackTool.Win32.Inject.heur Signature

Malicious programs of this family inject their code into the address space of programs running on the infected computer, such as system processes or programs that have access to the Internet.

So this has vaguely something to do with injection. As the DLL doesn't have any calls to VirtualAllocEx, VirtualProtectEx, etc then it's likely also be flagging on the same ReflectiveLoader Export as the Yara rules - although it could also be the main thread running to run Maelstrom():

hThread = CreateThread(NULL, NULL, ThreadFunction, NULL, 0, NULL);

Although that's probably unlikely.

Microsoft's Win32/Sabsik.TE.B!ml Signature

Cloud Submissions Enabled

Finally, to illustrate why, as an operator, it might be a good idea to turn off cloud submissions on your development machine, we re-ran the VirusTotal scans a few months after initially writing our implants.

Rerunning the scan on the 15th July 2022, it's now increased to 25 vendor detections, from January's 2:

Nearly a month later, on the 8th August 2022, and our DLL is picked up by 27 vendors:

EXE (unsafe)

EXE (safe)

As we discussed in the introduction, fresh code that genuinely doesn't contain previously used elements or suspicious strings will generally bypass most antivirus and EDR solutions on the market, even without a huge amount of work on the part of the operator. However, as we will see in the next post, there are numerous ways to get noticed at runtime, and thats without talking about operator behaviour (which we won't cover in this series).

Intezer

Below is a demonstration of using Intezer within a pipeline:

In the above example, the information is a bit scarce. However, Generic Malware was hit which could be a indicator-of-compromise.

In addition, Intezer tries to identify capabilities, TTPS and general indicator-of-compromise. Obviously, don't upload your entire implant, but it could be useful to assess certain aspects of the implant to see what behaviour is considered suspicious.

Entropy

Entropy is a concept from physics - essentially the measure of the randomness. When it comes to computing, most programs have a "normal" level of randomness when looking at their raw bytes - words, phrases, and code generally is more predictable than a random string of bytes. A C2's implant, especially one which heavily relies on encrypting strings will have an abnormal level of entropy that will stand out as these encrypted regions will appear less predictable.

As entropy is reasonably quick to calculate, it can act as a quick and simple measure of the likelihood of a program being malicious.

As the above graph shows, entropy levels above 6.5 are increasingly suspicious, and entropy values above 7 can be reasonably assumed to be malicious and require further inspection. There is an intriguing spike at the lower end of the scale - after all, this is merely a rough indicator, especially when compared to other measures. At a guess, this might be due to smaller PowerShell or other reverse shell scripts with minimal encryption, or where encrypted strings such as the IV for channel encryption comprise a small part of an otherwise far larger implant.

for i in $(find -regex '.*\.\(exe\|dll\)'); do echo $i && ent $i|grep Entropy && echo -n '\n'; done

Thankfully, as all of our libraries are pretty minimal anyway and don't heavily use encryption beyond what's needed, we've thankfully been able to dodge artificially reducing our entropy to a reasonable level:

./agent/stage0/bin/maelstrom.unsafe.x64.exe
Entropy = 5.254732 bits per byte.

./agent/stage0/bin/maelstrom.safe.x64.exe
Entropy = 5.270877 bits per byte.

./agent/stage1/bin/maelstrom.x64.dll
Entropy = 5.787415 bits per byte.

Authenticode

At a basic level however, Authenticode allows for code to be signed using a digital certificate. This can be included natively within Visual Studio or added in as a compilation step. When running code, Windows will check the certificate to ensure that the code is signed by a valid certificate authority prior to running.

In the absence of detailed analysis on our part, we'd recommend reading the following posts:

Conclusion

While there are plenty more checks that can be run against our implant to test their response to detections, we've only showed a few of them in this post.

Some exercises for the reader include removing the DOS Message and NT Headers, as well as evaluating the implant performance against other crowdsourced Yara rules. Naturally when developing an actual implant, as we saw in our review of VirusTotal over time, using VirusTotal to test your implant is a good way to cap its usable lifetime.

In our introduction we discussed how some implants seem to just run, and some environments seem to just permit implants without flagging them. Using the techniques we have examined today, the answer should hopefully be simple - they just didn't happen to trigger enough suspicion. Lots of code within an implant isn't malicious; and a freshly written implant that doesn't reference or include prior suspicious works won't necessarily be suspicious. In the same vein, there are only so many ways to call WinAPI or carry out suspicious behaviours such as a reflective loader, and even one which has just been authored can contain calls and functions that have long been flagged as suspicious.

Unfortunately, the answer is to just keep testing and re-testing the implant against EDRs and Yara rules, and continue to develop further novel and or overlooked techniques to maintain the edge. For the defender, the more rules and catalogues that can be included within the environment, the high the chance of flagging an implant early.

In our next post, we will look at the implant at runtime by examining its behaviour, its memory, and its general OpSec. We will again look at how it can be detected and, in turn, at how an operator might begin to consider bypassing these detections.

PreviousHome NextMaelstrom #6: Working with AMSI and ETW for Red and Blue

Last updated 9 months ago