Fix oom #2665

shargon · 2022-02-26T12:08:00Z

fyrchik · 2022-02-26T12:59:04Z

This solution restricts intermediate array size, even though the final output can be empty (e.g. the last level index can be out of range). I have implemented another solution (process value in a DFS fashion and restrict the size the real result to MaxOracleResponseSize), but this one is simpler to implement and covers all reasonable cases. Good job!

fyrchik · 2022-02-26T13:09:07Z

tests/neo.UnitTests/IO/Json/UT_JPath.cs

@@ -54,6 +55,13 @@ public class UT_JPath
            ["data"] = null,
        };

+        [TestMethod]


Could you also add a boundary test where the size of an intermediate array is approximately equal to the maximum allowed limit?

The size in case is width^depth (64^6 in this case). So tests for 2^10 (width 10, depth 2) and 32^2 (width 2, depth 32) should pass and result in an array of 1024 empty objects. Note, that the value itself should also be changed in this case (amount of [ should be equal to depth).

This will help to ensure neo-go stays compatible.

My bad, the second test shouldn't pass because of the depth limit.

Jim8y · 2022-02-26T16:41:13Z

src/neo/IO/Json/JPathToken.cs

        {
            List<JObject> results = new();
            JPathToken token = DequeueToken(tokens);
            if (token.Type != JPathTokenType.Identifier) throw new FormatException();
            while (objects.Length > 0)
            {
                results.AddRange(objects.Where(p => p is not null).SelectMany(p => p.Properties).Where(p => p.Key == token.Content).Select(p => p.Value));
-                Descent(ref objects, ref maxDepth);
+                Descent(ref objects, ref maxDepth, maxObjects);
+                if (results.Count > maxObjects) throw new InvalidOperationException(nameof(maxObjects));


I would suggest to make the exception more precise, use ArgumentOutOfRangeException or OutOfMemoryException instead.

shargon · 2022-02-28T08:26:22Z

I have implemented another solution (process value in a DFS fashion and restrict the size the real result to MaxOracleResponseSize)

Could you share your solution?

fyrchik · 2022-02-28T12:45:19Z

Could you share your solution?

Here it is nspcc-dev/neo-go@e4ec405 .

The idea is that we first parse the filter itself into some struct, which allows us to perform DFS on the value we apply filter on.
Good things about it:

JSON can be formed simultaneously with traversal, avoiding need to construct objects in memory (haven't finished this yet, because of purely technical difficulties).
It can return the correct answer in more cases.

However, there is a single bad thing which makes me like your solution more: my implementation leaves room for filters which take long time to process. And this can't be reflected in GAS fee. I can't come up with an example immediately, but it could exist in theory.

Jim8y · 2022-02-28T13:56:33Z

@fyrchik love your solution, may you please share with us the time it costs to run this oom case when you fully implement it in dfs?

Jim8y · 2022-03-01T04:37:24Z

@shargon hey shargon, do you have any plan to further optimise the jsonpath filter? JsonPath should not process such an empty or simple json at all in this oom case, maybe we can have a preprocess or something to verify both the json and filter before executing it.

fyrchik · 2022-03-01T05:54:41Z

@Liaojinghui the remaining part won't change anything significantly, sadly I also don't have time to work on finishing this. However, some results can already be seen. I took the example from this issue, replaced the last [0,0... with [1,1... and benchmarked againt different depth parameters.

depth  time
4      22.766441ms
5      1.142487091s
6      1m18.027576585s

So, 1 minute execution time is certainly not what we want.

shargon · 2022-03-03T09:50:15Z

So, 1 minute execution time is certainly not what we want.

We can reduce the max filter length, and the max depth to 4.

fyrchik · 2022-03-04T12:12:59Z

@shargon but should we? In your solution it is easy to see why the time is bounded: we move through the filter in one direction,
maintain the list of current objects and don't allow this list to grow large. IMO limiting max depth can prevent some reasonable applications from working, while limiting intermediate objects amount achieve exactly what we want: do not process large or time-consuming inputs.

Jim8y · 2022-03-04T13:29:55Z

@fyrchik how do we deal with a reasonable application but will cause oom? No matter how reasonable it is, the core problem here is we do not have the ability or enough memory to process it.

And size limit can be seen everywhere in the virtual machine.

fyrchik · 2022-03-09T07:38:30Z

@Liaojinghui I think we agree on the issue. The choice here is between restricting the amount of intermediate objects and further restricting maximum allowed depth. They serve the same purpose, so I see less value in restricting both of these parameters.
But with the former (this PR) we will be able to support a wider class of reasonable applications.

src/neo/IO/Json/JPathToken.cs

* Add ToJson overload (#2671) * Add ToJson overload * change * Update src/neo/VM/Helper.cs * Update src/neo/VM/Helper.cs * Update src/neo/VM/Helper.cs * Update src/neo/VM/Helper.cs Co-authored-by: Jinghui Liao <jinghui@wayne.edu> * Update src/neo/VM/Helper.cs Co-authored-by: Jinghui Liao <jinghui@wayne.edu> Co-authored-by: Shargon <shargon@gmail.com> Co-authored-by: Jinghui Liao <jinghui@wayne.edu> * Fix oom (#2665) * Fix oom * Revert reorder * parameters order Co-authored-by: Erik Zhang <erik@neo.org> * Optimize inventory (#2659) * add `murmur32` to crypto lib (#2604) * 3.2.0 * fix Co-authored-by: Shargon <shargon@gmail.com> Co-authored-by: Jinghui Liao <jinghui@wayne.edu>

Fix oom

72da5d8

shargon requested a review from erikzhang February 26, 2022 12:08

fyrchik reviewed Feb 26, 2022

View reviewed changes

Jim8y approved these changes Feb 26, 2022

View reviewed changes

Jim8y reviewed Feb 26, 2022

View reviewed changes

steven1227 approved these changes Feb 26, 2022

View reviewed changes

erikzhang reviewed Mar 9, 2022

View reviewed changes

src/neo/IO/Json/JPathToken.cs Outdated Show resolved Hide resolved

Revert reorder

65dfa67

Jim8y approved these changes Mar 12, 2022

View reviewed changes

parameters order

79ced35

erikzhang approved these changes Mar 17, 2022

View reviewed changes

shargon merged commit 24389c6 into neo-project:develop Mar 17, 2022

shargon deleted the fix-oom branch March 17, 2022 09:12

superboyiii mentioned this pull request Mar 18, 2022

Neo v3.2.1 Checklist #2676

Closed

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix oom #2665

Fix oom #2665

shargon commented Feb 26, 2022

fyrchik commented Feb 26, 2022

fyrchik Feb 26, 2022

fyrchik Feb 26, 2022

Jim8y Feb 26, 2022

shargon commented Feb 28, 2022

fyrchik commented Feb 28, 2022 •

edited

Loading

Jim8y commented Feb 28, 2022

Jim8y commented Mar 1, 2022 •

edited

Loading

fyrchik commented Mar 1, 2022

shargon commented Mar 3, 2022

fyrchik commented Mar 4, 2022

Jim8y commented Mar 4, 2022 •

edited

Loading

fyrchik commented Mar 9, 2022

Fix oom #2665

Fix oom #2665

Conversation

shargon commented Feb 26, 2022

fyrchik commented Feb 26, 2022

fyrchik Feb 26, 2022

Choose a reason for hiding this comment

fyrchik Feb 26, 2022

Choose a reason for hiding this comment

Jim8y Feb 26, 2022

Choose a reason for hiding this comment

shargon commented Feb 28, 2022

fyrchik commented Feb 28, 2022 • edited Loading

Jim8y commented Feb 28, 2022

Jim8y commented Mar 1, 2022 • edited Loading

fyrchik commented Mar 1, 2022

shargon commented Mar 3, 2022

fyrchik commented Mar 4, 2022

Jim8y commented Mar 4, 2022 • edited Loading

fyrchik commented Mar 9, 2022

fyrchik commented Feb 28, 2022 •

edited

Loading

Jim8y commented Mar 1, 2022 •

edited

Loading

Jim8y commented Mar 4, 2022 •

edited

Loading