-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Design for d6tflow framework #123
Comments
@lyriccoder can you please write this issue in terms of Problem and Proposed solution Also, give the issue a more informative name |
Some comments: |
E.g., if we filter by ncss, we add to the all data ncss value. If we filter invocations by the |
(1) OK. do we need to pass the AST to the next step then? (2) I meant, just have a more abstract representation processing steps. Merging them is a way of abstraction (3) Filter is not data, it s an operator. Or do you mean something else by "filter"? The edges have to be labeled with data only. Nodes are operators. You can maybe write "filters(data)" |
|
We can split our tasks to the following Task of
d6tflow
frameworkTask1 -> open Java file with correct encoding
Task2 -> remove all spaces and comments in it and save to another file
Task3 -> open file, find all method which can be inlined. Save target, extracted, full_ast, text_file, filename, row_csv from Task2
Task4 -> Task3 get target, extracted and filter it. Save target, extracted, full_ast, text_file, filename, row_csv from Task3
Task5 -> get result from Task3 and filter limited cases. Save target, extracted, full_ast, text_file, filename, row_csv from Task4
Task6 -> Inline Method, save file, row_csv
Task 7 -> save row_csv to global DataFrame
Possible problems:
preprocessed
files to external memory, since we will have lots of files and it won't have enough memory to keep them in cache. Also, we have to keep them also in external memory since, it's our dataset which will be validated.Seems, it cannot be done due to Reconfigure save() option to point to external store d6t/d6tflow#6
Support for Tasks that outputs different types and other extensions... d6t/d6tflow#26
The text was updated successfully, but these errors were encountered: