Commit Graph

316 Commits

Author SHA1 Message Date
f7d70eed6c fixed how we get the diffs before (it was wrong),
extracted the way to get the last commit before the comments
2025-04-01 09:52:57 +02:00
af4fbaa7f3 added the type of the error in the print, because
some errors are not very verbose in what's going wrong
2025-04-01 09:48:24 +02:00
c31686ad63 not compiling, testing, etc. for files that are
not code related
2025-04-01 09:20:37 +02:00
0b238db879 fixed the name of the archive 2025-04-01 09:20:26 +02:00
28eebf158a some users have been deleted since, so the user
attribute of the comment is None
2025-04-01 09:19:52 +02:00
306e80648b added print statements 2025-04-01 09:19:35 +02:00
e697890395 now archiving both before (context for AI) and after (tests for benchmark) 2025-03-31 21:50:11 +02:00
d8ab48dc82 excluding repos that have no comments 2025-03-31 21:25:08 +02:00
352758a600 not failing on unexpected error, but writing them
above the progress bar
2025-03-31 21:24:21 +02:00
c20f9d6a6c made checkout function more modular 2025-03-31 21:20:56 +02:00
f79d3d7807 made a better job to check if the second comment
is an answer to the first one
2025-03-31 21:20:17 +02:00
1a53f28ae0 now archiving the repo at the given pr 2025-03-31 15:55:23 +02:00
306aa4a985 removed the only_inject_jacoco 2025-03-31 15:49:25 +02:00
480dacea3e now using normal names 2025-03-31 15:32:18 +02:00
b482c35b90 cleaned up dataset 2025-03-31 15:31:36 +02:00
6bd30ef545 removed unused imports 2025-03-31 15:31:04 +02:00
f785364fb8 made a unique bar for the processing of the pr 2025-03-31 15:30:14 +02:00
abc642d969 fixed slight issue with naming of variables 2025-03-31 15:29:43 +02:00
941e0cb19f fixed the way we get the diffs after 2025-03-31 15:29:18 +02:00
61ed6aa1b9 fixed mistake 2025-03-31 15:29:02 +02:00
669049b7a4 now using only the new datset version 2025-03-31 14:25:17 +02:00
35bd296c7c made clone use raising expections instead of updates 2025-03-31 13:21:04 +02:00
46d8d45d7c Formatted utils.py 2025-03-31 11:49:36 +02:00
308f58b587 fixed final edgecase 2025-03-30 10:58:48 +02:00
d24c9d8461 removed progress bar that was instant 2025-03-29 10:25:31 +01:00
7e64ab6574 moved github logging to file 2025-03-29 09:44:00 +01:00
dd5a67561b commented out annoying code 2025-03-29 09:43:51 +01:00
e081560879 imported function from utils 2025-03-29 09:43:39 +01:00
aaafe21a3c added progress bar for each entry migration 2025-03-29 09:43:18 +01:00
d7cba34e3d made so that binary files content are ignored 2025-03-28 18:18:36 +01:00
69bf557a61 made migration better 2025-03-28 15:04:01 +01:00
838837a788 extracted function to utils 2025-03-28 15:03:45 +01:00
649043d9f0 first draft of migration to augment the data 2025-03-28 11:15:21 +01:00
0ed221acb8 added jacoco injections 2025-03-27 14:05:17 +01:00
e8cf0b4e37 formatted handlers.py 2025-03-27 09:42:37 +01:00
2e04ed49a3 fixed slight mistake 2025-03-26 14:50:36 +01:00
19729fae7c wrote better code 2025-03-26 14:09:21 +01:00
be34e9d2f6 put check for bot as first and apparently that
sped up the process quite a bit, nice
2025-03-26 14:04:14 +01:00
99d8abb7e7 moved some lines around to defer responsability 2025-03-26 13:43:33 +01:00
b02f08452d instead of making two passes through the PRs, we
just do one, and process it if it's good
2025-03-26 13:42:41 +01:00
9a48413ead added line for completness even tho it will never
be executed
2025-03-26 13:40:31 +01:00
0d8b81054d formatted code 2025-03-26 13:05:05 +01:00
fa3b7f82a1 comverted code to have instead of one comment,
have a list of them
2025-03-26 12:41:46 +01:00
4c6522ae63 removed useless code 2025-03-26 12:41:01 +01:00
77c5e2bec0 removed code pertaining to stats, now only relying
on cache
2025-03-26 09:05:49 +01:00
4c56a352e7 added default parameter to keep or not the "Was
still being processed" item from the backup (to delegate to the caller if he wants it)
2025-03-26 09:02:32 +01:00
d95e4ebdf8 fixed slight mistake 2025-03-23 13:55:24 +01:00
e685e62341 improved the lazy loading from cache even more 2025-03-23 13:26:23 +01:00
ad96cac750 covered newly discovered edge case 2025-03-23 11:15:44 +01:00
1aa42c32cd improved caching 2025-03-23 10:54:43 +01:00