acce738872
made the requests never expire
2025-05-14 09:39:55 +02:00
0b02518374
found out that some couldn't checkout due to
...
conflicts, but if you force it, it works
2025-05-14 09:36:12 +02:00
ccd962c205
now using the metadata to get archive name
2025-05-14 09:36:06 +02:00
36b7dc5c02
added uuid when creating the dataset
2025-05-07 10:38:20 +02:00
e9816d4492
removed unused imports
2025-04-05 16:01:28 +02:00
bf8869e66c
was accidentally copying over prs that were cached
...
twice
2025-04-01 15:45:23 +02:00
d4dd72469e
instead of creating a list of the comments, using
...
the paginated list and totalCount
2025-04-01 14:46:42 +02:00
e2f313a62a
made better argparse things
2025-04-01 12:15:24 +02:00
12b98bf1ef
removed the throttle of pygithub to make requests
...
faster
2025-04-01 11:45:43 +02:00
6d28d89873
added return guard to remove indent level
2025-04-01 11:01:06 +02:00
bc71a21c30
instead of leaving reason_for_failure empty for
...
valid PRs, I now put that it's valid (even tho it's not a reason for _failure_ techinally, gne gne gne...)
2025-04-01 11:00:11 +02:00
a362aba344
added a simple caching of the requests to make it
...
much quicker to fail and restart
2025-04-01 10:14:45 +02:00
a24ffa00fc
made help message shorter
2025-04-01 10:14:19 +02:00
b9d1923bd8
since the comment file might not be in the PR
...
files (since it was reverted back to its original state, we manually need to check if it's code related)
2025-04-01 09:53:26 +02:00
f7d70eed6c
fixed how we get the diffs before (it was wrong),
...
extracted the way to get the last commit before the comments
2025-04-01 09:52:57 +02:00
af4fbaa7f3
added the type of the error in the print, because
...
some errors are not very verbose in what's going wrong
2025-04-01 09:48:24 +02:00
c31686ad63
not compiling, testing, etc. for files that are
...
not code related
2025-04-01 09:20:37 +02:00
0b238db879
fixed the name of the archive
2025-04-01 09:20:26 +02:00
28eebf158a
some users have been deleted since, so the user
...
attribute of the comment is None
2025-04-01 09:19:52 +02:00
306e80648b
added print statements
2025-04-01 09:19:35 +02:00
e697890395
now archiving both before (context for AI) and after (tests for benchmark)
2025-03-31 21:50:11 +02:00
d8ab48dc82
excluding repos that have no comments
2025-03-31 21:25:08 +02:00
352758a600
not failing on unexpected error, but writing them
...
above the progress bar
2025-03-31 21:24:21 +02:00
c20f9d6a6c
made checkout function more modular
2025-03-31 21:20:56 +02:00
f79d3d7807
made a better job to check if the second comment
...
is an answer to the first one
2025-03-31 21:20:17 +02:00
1a53f28ae0
now archiving the repo at the given pr
2025-03-31 15:55:23 +02:00
306aa4a985
removed the only_inject_jacoco
2025-03-31 15:49:25 +02:00
480dacea3e
now using normal names
2025-03-31 15:32:18 +02:00
6bd30ef545
removed unused imports
2025-03-31 15:31:04 +02:00
f785364fb8
made a unique bar for the processing of the pr
2025-03-31 15:30:14 +02:00
abc642d969
fixed slight issue with naming of variables
2025-03-31 15:29:43 +02:00
941e0cb19f
fixed the way we get the diffs after
2025-03-31 15:29:18 +02:00
61ed6aa1b9
fixed mistake
2025-03-31 15:29:02 +02:00
669049b7a4
now using only the new datset version
2025-03-31 14:25:17 +02:00
838837a788
extracted function to utils
2025-03-28 15:03:45 +01:00
0ed221acb8
added jacoco injections
2025-03-27 14:05:17 +01:00
19729fae7c
wrote better code
2025-03-26 14:09:21 +01:00
be34e9d2f6
put check for bot as first and apparently that
...
sped up the process quite a bit, nice
2025-03-26 14:04:14 +01:00
99d8abb7e7
moved some lines around to defer responsability
2025-03-26 13:43:33 +01:00
b02f08452d
instead of making two passes through the PRs, we
...
just do one, and process it if it's good
2025-03-26 13:42:41 +01:00
9a48413ead
added line for completness even tho it will never
...
be executed
2025-03-26 13:40:31 +01:00
0d8b81054d
formatted code
2025-03-26 13:05:05 +01:00
fa3b7f82a1
comverted code to have instead of one comment,
...
have a list of them
2025-03-26 12:41:46 +01:00
77c5e2bec0
removed code pertaining to stats, now only relying
...
on cache
2025-03-26 09:05:49 +01:00
e685e62341
improved the lazy loading from cache even more
2025-03-23 13:26:23 +01:00
1aa42c32cd
improved caching
2025-03-23 10:54:43 +01:00
2ba71d8523
renamed variable because its better
2025-03-23 09:58:30 +01:00
dc897ac375
first draft of using cache to resume progress
2025-03-23 09:52:43 +01:00
1a3714422d
added try catch for unexepected exception
2025-03-23 09:27:38 +01:00
282f29520b
since there are multiple jacoco.xml files
...
possible and it's too hard to understand which one
is the correct one for the class, I just log in
the coverages each one that have the fully
qualified class that is commented, it will then be
up to us to filter out what's needed
2025-03-21 13:44:58 +01:00