Fuzzing data optimisation #717
denis-fokin
started this conversation in
Ideas
Replies: 2 comments 1 reply
-
ML or not ML is a good questions, but collect and analyze typical mocks/junit tests as a dataset will be useful |
Beta Was this translation helpful? Give feedback.
1 reply
-
If it's possible to detect the generic type of a Collection, it would be good to add a test with some meaningful (prepared?) values in it - starting from String, Integer. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
There are some hints that can help us with finding better values for fuzzing.
Regular expressions
Multiple solutions exist to generate strings from regular expressions.
For instance,
If we find a regular expression check somewhere on the execution path, we can generate a number of random values that matches the regular expression. The results could be used for fuzzing.
DSL
Similar approach can be applied to different DSLs. For instance, we can find some piece of HTML. We can modify the HTML string and try to change the HTML content without breaking its validity.
API
ML could give us a helping hand. We can ask for a common data that typically used for particular API. This way will get proper pieces of text for API that expects JSON, HTML or other input. The input can be modified randomly for better fuzzing results.
Alternatively, can build a dataset for most common APIs like ktor, Jackson and so on.
Beta Was this translation helpful? Give feedback.
All reactions