326 Commits

Author SHA1 Message Date
69295801ac entries in the dataset now store what build system
they use
2025-03-17 15:04:33 +01:00
d7b3d62c0c generating and checking the coverage with jacoco 2025-03-17 11:47:54 +01:00
47763f5803 when we are done processing the repo, we check it
out to the main/master branch
2025-03-17 10:49:37 +01:00
f926be95f9 since we cloned the repos with --depth 1, we don't
have their history, now i'm ensure it's there
2025-03-17 10:49:34 +01:00
ca6198410f forgot to set the client of the the build_handler 2025-03-17 10:48:38 +01:00
38b904b898 fixed postfix 2025-03-17 10:48:31 +01:00
65306af557 now saving the dataset after each successfully
stored PR
2025-03-17 10:48:09 +01:00
79ead075fa removing PRs opened by bots 2025-03-17 10:39:47 +01:00
2fad232dd2 now using the build handler to check for
compilation at the commit of merge
2025-03-17 09:22:32 +01:00
082f6c6f71 using expections to interrupt the flow of
execution instead of having to propagate a boolean
2025-03-14 16:10:54 +01:00
b04d5b410a made postfix better 2025-03-14 15:23:12 +01:00
42d734877e made better length of dataset 2025-03-14 15:21:42 +01:00
76b79105ef fixed typing of field 2025-03-14 15:21:36 +01:00
f5275b3c1b making assumption that the entry will be
successful, then setting it to false when something goes wrong
2025-03-14 15:20:59 +01:00
6c6880715e unified the interface of funcitons in utils (made
clone return a bool that indicates success of cloning)
2025-03-14 15:14:48 +01:00
8f854cfa09 extracted function 2025-03-14 15:10:42 +01:00
4544922165 extracted a function, updated it, and using it in
pull_requests
2025-03-14 14:38:24 +01:00
fd82ff5128 moved github logging to file 2025-03-14 14:07:31 +01:00
8e1bbd15ff removed useless paramater 2025-03-14 14:05:32 +01:00
cc8cb7ef52 extracted function to utils 2025-03-14 14:04:41 +01:00
5a2865a78d minor stuff 2025-03-14 14:03:04 +01:00
4b06d4920b removed tqdm because it was too distracting 2025-03-14 14:02:38 +01:00
c1c2106a86 made the has_only_1_comment function potentially
verbose
2025-03-14 12:06:03 +01:00
9982e2bcc9 added the diff class to dataset entries 2025-03-14 12:05:39 +01:00
6437efaa54 first clean version of storing the entries of the
dataset, still need to check for tests
2025-03-14 12:05:19 +01:00
468f793ded added the possibility of having the lenght of the
dataset
2025-03-14 11:59:37 +01:00
242d7f6a77 made some entries to the dataset default to "" 2025-03-14 11:59:21 +01:00
d845ce9171 removed sorting of the commit dates because it
isn't useful
2025-03-14 10:07:07 +01:00
8f4614f71a removed useless import 2025-03-14 09:55:11 +01:00
c035bfbe77 updated stats_pull_requests to use the new utils
interface
2025-03-14 09:49:50 +01:00
115e275479 forgot to actually check whether the ammount of
comments was 1, now it's fixed
2025-03-14 09:48:57 +01:00
49e8e29e7a updated the type annotations for the utils
functions, much better now
2025-03-14 09:48:38 +01:00
9f8884a1e9 created the dataclasses for the "triplets" 2025-03-14 09:27:51 +01:00
873f712640 forgot an import 2025-03-13 14:25:25 +01:00
536e6e3e3f extracted some functions to a utils file, since
they will be usefull for the new pull_requests.py
2025-03-13 11:31:27 +01:00
3a1b24dcb3 actually the file shouldn't be called
'good_repos.csv' because there aren't only good repos, so calling it 'repos.csv'
2025-03-13 11:25:13 +01:00
f8568fe8ec cleared pull_requests.py to start anew 2025-03-13 11:11:21 +01:00
02d6c035fd renamed the results file, because "results.csv"
was too generic
2025-03-13 11:09:10 +01:00
e69e81e5b3 minor fixes 2025-03-12 11:54:11 +01:00
8c1de40a1f fixed the way to check whether a pr had only one comment 2025-03-11 14:27:18 +01:00
7c7b8801eb switched lines because it made more sense 2025-03-11 14:23:30 +01:00
f76cd6f36f added entry to exclusion list 2025-03-11 10:26:44 +01:00
a22776a96c fixed the stdout and file logging issue 2025-03-10 11:29:09 +01:00
5416b2ba15 saving the stats at each repo, so that we can see
the progress
2025-03-10 10:42:56 +01:00
fc0fdd2e43 added entry to exclusion list 2025-03-10 10:39:13 +01:00
8b5bae858e getting stats for repos that actually have tests 2025-03-10 10:38:56 +01:00
500531c8b0 apparently some dates are now parsed correctly
when the json is parsed, but others are still in string form, so i'm trying to account for that
2025-03-10 10:28:29 +01:00
aa5ed85717 made better post fix 2025-03-10 10:28:13 +01:00
527f9ea1af made logging better to avoid messing up the
progress bar
2025-03-10 10:27:43 +01:00
d34b3dbb20 made stats for PRs better, because i was not
considering all edge cases
2025-03-10 09:54:21 +01:00