Specific Syscalls #152

ryantxu1 · 2023-04-10T18:24:28Z

Adding specific linux and windows syscalls to the taxonomy.

Before merging @hack-sentinel I have a few questions:

I don’t know how to get hierarchy tree updated
Do these entries of specific calls need neighbors?
I have a case where the Linux Pause Syscall can suspend a thread and a process. How can I duplicate a "Linux Pause" class in protégé? If I can't do that, how should the protégé iri be handled?

hack-sentinel · 2023-04-11T23:47:36Z

Adding specific linux and windows syscalls to the taxonomy.

Before merging @hack-sentinel I have a few questions:
* I don’t know how to get hierarchy tree updated

If I understand the problem, you are making edits but not seeing those in the tree viewer for the classes. Sometimes the Protege app will not display what it should even though you've made a change. Try closing the tree and re-opening. You may need to save and reload to get it to render the tree as expected.

* Do these entries of specific calls need neighbors?

[It's possible I haven't interpreted this question right, but here goes...]

By 'neighbors' I will assume you mean specifically sibling entries for specific types of calls for say, different OSes (i.e., children of a common parent class.) As an example of a singleton (no siblings aka neighbors), I see a singleton child 'Windows NtDuplicateToken' as subclass (child) of 'Copy Token'. Generally, we'd hope to find some additional children, just like one would in document outlines so we would have something additional to contrast or even to create a partition across the children classes. But sometimes there is a specialization of a parent concept that needs to be expressed and is just that, a singleton class. Don't worry too much if you haven't found an equivalent Linux specialization of the notion of 'Copy Token' just yet. If you are highly confident there is no equivalent anywhere, then moving Windows NtDuplicateToken up a level might make sense the general case of ontology class design [1].

[1] I'm not sure of all the use cases or prior discussion here, so even if you are sure there is no Linux analog to Windows for Copy Token, delay moving it up until we have a quick chat with the rest of the syscall taxonomy dev & review team. Having an 'unnecessary' abstraction won't hurt any of the logic and might make it easier to navigate (having a first layer of only syscall abstractions at the top level of System Call hierarchy could come in handy.)

* I have a case where the Linux Pause Syscall can suspend a thread and a process. How can I duplicate a "Linux Pause" class in protégé? If I can't do that, how should the protégé iri be handled?

Well you can reuse a class in the sense it can show up in different contexts, structures, or occasionally even have multiple parents. If there are different concepts (say one for pausing a thread and one for pausing a process) then they should be different classes with different IRIs and different labels. I'd recommend having two different classes here: one under Suspend Process (rename existing IRI and label from LinuxPause and Linux Pause to LinuxPauseProcess and 'Linux Pause Process) respectively and add one under Suspend Thread (Linux Pause Thread). Both of these Linux children classes can reference seeAlso -> ..../pause(2). Then copy over and distinguish the definition of Linux Pause Thread, borrowing from Linux Pause Process.

ioggstream · 2023-04-12T19:42:51Z

Shouldn't we have a POSIX taxonomy first?

netfl0 · 2023-04-27T15:57:00Z

Shouldn't we have a POSIX taxonomy first?

This is a great idea, since we've already done the windows analysis perhaps second :)

@ryantxu1 let me know your thoughts here.

Cc @hack-sentinel

ryantxu1 · 2023-05-01T15:46:51Z

@netfl0 Open to exploring this eventually, are we envisioning a similar process of bucketing the system interfaces in POSIX similar to the linux/windows syscalls?

src/ontology/d3fend-protege.ttl

ioggstream · 2023-05-03T16:02:01Z

Further considerations you may have already taken into account and that - imho - could shift the bar towards POSIX APIs. OTOH POSIX APIs can be either system calls (eg. execve ) or library functions (malloc).

Linux has ~400 syscalls with different security profiles. Moreover some syscalls (e.g. fork) are implemented on top of others. Some security-related functions (e.g. malloc/free) are not syscalls (but I don't think people will reference man 2 brk when tagging objects with d3fend).

man 2 syscalls | grep '[a-z]\(2\)     ' -c

Forking can done in different ways, e.g.,

fork / vfork
clone / clone2 / clone3

POSIX has less APIs but maybe more famous, as they include fork, open, malloc, free, printf. Taking for granted the perl POSIX interface man page, POSIX has ~300 calls, some of which just library calls.

man posix| grep '    "[a-z0-9]+"'

It is probably easier to use that kind of reference. Moreover it can apply to other OSs

Questions:

which is the mapping strategy you used for windows syscalls?

ioggstream · 2023-05-03T16:08:16Z

src/ontology/d3fend-protege.ttl

@@ -16216,8 +16593,7 @@ order to constitute a complete standard. For a complete definition of all requir
    rdfs:label "Linux ELF File 64bit" .

 :LinuxExec a :CreateProcess,


LinuxExec does not create a process. Instead, it replaces the program that is currently being run by the calling process with a new program, with newly initialized stack, heap, and (initialized and
uninitialized) data segments.

There are other parts of the process that are not replaced, eg.,

The process's real UID and real GID, as well its supplementary group IDs, are unchanged;
file descriptors remain open unless marked close-on-exec.

Syscalls are a slippery slope...

ioggstream · 2023-05-03T16:17:39Z

src/ontology/d3fend-protege.ttl

@@ -6264,6 +6296,216 @@ Most current Unix-like systems and Microsoft Windows support loadable kernel mod
    rdfs:subClassOf :DigitalArtifact ;
    rdfs:seeAlso "https://dbpedia.org/resource/Link" .

+:Linux_Exit a owl:Class ;
+    rdfs:label "Linux _Exit" ;


Can't find _exit in man 2 syscalls | grep _exit
instead, there's an _exit in POSIX.

Here and elsewhere: why don't use the verbatim syscall name, e.g. "exit(2)" in the label?

@ioggstream I put '_exit' because the man page of that specifically states that as the name when actually calling the function.
https://man7.org/linux/man-pages/man2/exit.2.html

That is also why I left out the '(2)' in the labels as I was focusing on the function names themselves. However, I'm happy to reconsider the naming convention if there's an advantage to do so!

Using system call names or function signatures is a "design" choice. Since implementations can vary in time (see also the link you posted https://man7.org/linux/man-pages/man2/exit.2.html) if we want to focus on Linux, I'd use a man-like uri (e.g. man-pages/man2/exit.2.html references both exit and 2).

This allows us to define a POSIX version (e.g., POSIX_exit)

Moreover, I'd avoid camelizing the syscall name (e.g., Linux-exit or Linux-_exit, or something like that)

ioggstream · 2023-05-03T16:24:27Z

src/ontology/d3fend-protege.ttl

+:LinuxExecve a owl:Class ;
+    rdfs:label "Linux Execve" ;
+    rdfs:subClassOf :CreateProcess ;
+    :definition "Execute program." ;
+    rdfs:seeAlso "https://man7.org/linux/man-pages/man2/execve.2.html" .
+
+:LinuxExecveat a owl:Class ;
+    rdfs:label "Linux Execveat" ;
+    rdfs:subClassOf :CreateProcess ;
+    :definition "Execute program relative to a directory file descriptor." ;
+    rdfs:seeAlso "https://man7.org/linux/man-pages/man2/execveat.2.html" .


Are Execute and Create in the same class?

They both Create Process (this they are both a sublcass of that semantic class) if that makes sense. We've got some clean up coming, @ryantxu1 will be pushing to this branch soon.

Understood. Since I don't think this reflects the Linux taxonomy I will wait for the cleanup.

src/ontology/d3fend-protege.ttl

netfl0 · 2023-06-16T17:39:01Z

let me know when ready for review

ryantxu1 · 2023-06-21T17:25:34Z

@netfl0 Ready for review

ioggstream · 2023-06-24T15:33:03Z

src/ontology/d3fend-protege.ttl

+        [ a owl:Restriction ;
+            owl:onProperty :executes ;
+            owl:someValuesFrom :Process ] ;
+    :definition "Executes a process." ;


IIUC with system calls, you execute a program. I see that in the Windows world there's spawnwe() that makes fork/exec, but the name ExecuteProcess is confusing to me.

ioggstream · 2023-06-24T15:39:24Z

src/ontology/d3fend-protege.ttl

@@ -4525,7 +4535,7 @@ SafeSEH might be applied only to some executable files or modules, allowing an a
            owl:someValuesFrom :ExecutableFile ],
        [ a owl:Restriction ;
            owl:onProperty :restricts ;
-            owl:someValuesFrom :CreateProcess ] ;
+            owl:someValuesFrom :SpawnProcess ] ;


Formally, when the execve(2) is invoked, the process was already spawn.

Moreover, I can create a new process just forking an existing one without execve(2) - e.g., running another instance of the same program.

ioggstream · 2023-06-25T14:19:40Z

src/ontology/d3fend-protege.ttl

+    rdfs:label "Spawn Process" ;
+    skos:altLabel "Process Spawn" ;
+    rdfs:subClassOf :SystemCall ;
+    :definition "A process spawn refers to a function that loads and executes a new child process.The current process may wait for the child to terminate or may continue to execute asynchronously. Creating a new subprocess requires enough memory in which both the child process and the current program can execute. There is a family of spawn functions in DOS, inherited by Microsoft Windows. There is also a different family of spawn functions in an optional extension of the POSIX standards.  Fork-exec is another technique combining two Unix system calls, which can effect a process spawn." ;


Suggested change

:definition "A process spawn refers to a function that loads and executes a new child process.The current process may wait for the child to terminate or may continue to execute asynchronously. Creating a new subprocess requires enough memory in which both the child process and the current program can execute. There is a family of spawn functions in DOS, inherited by Microsoft Windows. There is also a different family of spawn functions in an optional extension of the POSIX standards. Fork-exec is another technique combining two Unix system calls, which can effect a process spawn." ;

:definition "A process spawn refers to a function that loads an executable and executes it in a new child process. The current process may wait for the child to terminate or may continue to execute asynchronously. Creating a subprocess requires enough memory in which both the child process and the current program can execute. There is a family of spawn functions in DOS, inherited by Microsoft Windows. There is also a different family of spawn functions in an optional extension of the POSIX standards. Fork-exec is another technique combining two Unix system calls, which can effect a process spawn." ;

ioggstream · 2023-06-25T14:21:48Z

src/ontology/d3fend-protege.ttl

+            owl:onProperty :suspends ;
+            owl:someValuesFrom :Thread ] ;
+    :definition "Suspending a thread causes the thread to stop executing user-mode code." ;
+    rdfs:seeAlso "https://learn.microsoft.com/en-us/windows/win32/api/processthreadsapi/nf-processthreadsapi-suspendthread" .


See also https://man7.org/linux/man-pages/man2/signal.2.html

IIRC on Linux it's the Kernel scheduler pausing the thread (e.g., https://docs.kernel.org/scheduler/sched-design-CFS.html ) but I can be a bit rusty on this topic.

ryantxu1 added 2 commits April 3, 2023 11:05

prototype process+thread syscall additions

e0824ab

syscall additions

3c1d893

Clarifying suspend process/thread

95df9d2

netfl0 reviewed May 3, 2023

View reviewed changes

src/ontology/d3fend-protege.ttl Outdated Show resolved Hide resolved

netfl0 reviewed May 3, 2023

View reviewed changes

src/ontology/d3fend-protege.ttl Outdated Show resolved Hide resolved

ioggstream reviewed May 3, 2023

View reviewed changes

netfl0 reviewed Jun 15, 2023

View reviewed changes

src/ontology/d3fend-protege.ttl Outdated Show resolved Hide resolved

ryantxu1 added 2 commits June 20, 2023 11:54

Touch ups, redoing processes

d30f40b

Replacing NT source

ab11ca9

ioggstream reviewed Jun 24, 2023

View reviewed changes

ioggstream reviewed Jun 25, 2023

View reviewed changes

bump version

7a831b9

netfl0 force-pushed the syscall branch from 2081370 to ab11ca9 Compare July 8, 2023 01:08

merge develop

e053995

netfl0 merged commit e685e91 into d3fend:develop Jul 8, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specific Syscalls #152

Specific Syscalls #152

ryantxu1 commented Apr 10, 2023

hack-sentinel commented Apr 11, 2023 •

edited

Loading

ioggstream commented Apr 12, 2023

netfl0 commented Apr 27, 2023 •

edited

Loading

ryantxu1 commented May 1, 2023

ioggstream commented May 3, 2023

ioggstream May 3, 2023 •

edited

Loading

ioggstream May 3, 2023 •

edited

Loading

ryantxu1 Jun 19, 2023 •

edited

Loading

ioggstream Jun 25, 2023 •

edited

Loading

ioggstream May 3, 2023

netfl0 May 8, 2023

ioggstream May 9, 2023

netfl0 commented Jun 16, 2023

ryantxu1 commented Jun 21, 2023

ioggstream Jun 24, 2023 •

edited

Loading

ioggstream Jun 24, 2023

ioggstream Jun 25, 2023

ioggstream Jun 25, 2023 •

edited

Loading

		@@ -16216,8 +16593,7 @@ order to constitute a complete standard. For a complete definition of all requir
		rdfs:label "Linux ELF File 64bit" .

		:LinuxExec a :CreateProcess,

Specific Syscalls #152

Specific Syscalls #152

Conversation

ryantxu1 commented Apr 10, 2023

hack-sentinel commented Apr 11, 2023 • edited Loading

ioggstream commented Apr 12, 2023

netfl0 commented Apr 27, 2023 • edited Loading

ryantxu1 commented May 1, 2023

ioggstream commented May 3, 2023

ioggstream May 3, 2023 • edited Loading

Choose a reason for hiding this comment

ioggstream May 3, 2023 • edited Loading

Choose a reason for hiding this comment

ryantxu1 Jun 19, 2023 • edited Loading

Choose a reason for hiding this comment

ioggstream Jun 25, 2023 • edited Loading

Choose a reason for hiding this comment

ioggstream May 3, 2023

Choose a reason for hiding this comment

netfl0 May 8, 2023

Choose a reason for hiding this comment

ioggstream May 9, 2023

Choose a reason for hiding this comment

netfl0 commented Jun 16, 2023

ryantxu1 commented Jun 21, 2023

ioggstream Jun 24, 2023 • edited Loading

Choose a reason for hiding this comment

ioggstream Jun 24, 2023

Choose a reason for hiding this comment

ioggstream Jun 25, 2023

Choose a reason for hiding this comment

ioggstream Jun 25, 2023 • edited Loading

Choose a reason for hiding this comment

hack-sentinel commented Apr 11, 2023 •

edited

Loading

netfl0 commented Apr 27, 2023 •

edited

Loading

ioggstream May 3, 2023 •

edited

Loading

ioggstream May 3, 2023 •

edited

Loading

ryantxu1 Jun 19, 2023 •

edited

Loading

ioggstream Jun 25, 2023 •

edited

Loading

ioggstream Jun 24, 2023 •

edited

Loading

ioggstream Jun 25, 2023 •

edited

Loading