Commit Graph

  • 69295801ac entries in the dataset now store what build system they use Karma Riuk 2025-03-17 15:04:33 +01:00
  • d7b3d62c0c generating and checking the coverage with jacoco Karma Riuk 2025-03-17 11:29:07 +01:00
  • 47763f5803 when we are done processing the repo, we check it out to the main/master branch Karma Riuk 2025-03-17 10:49:37 +01:00
  • f926be95f9 since we cloned the repos with --depth 1, we don't have their history, now i'm ensure it's there Karma Riuk 2025-03-17 10:49:09 +01:00
  • ca6198410f forgot to set the client of the the build_handler Karma Riuk 2025-03-17 10:48:38 +01:00
  • 38b904b898 fixed postfix Karma Riuk 2025-03-17 10:48:31 +01:00
  • 65306af557 now saving the dataset after each successfully stored PR Karma Riuk 2025-03-17 10:48:09 +01:00
  • 79ead075fa removing PRs opened by bots Karma Riuk 2025-03-17 10:39:47 +01:00
  • 2fad232dd2 now using the build handler to check for compilation at the commit of merge Karma Riuk 2025-03-17 09:22:32 +01:00
  • 082f6c6f71 using expections to interrupt the flow of execution instead of having to propagate a boolean Karma Riuk 2025-03-14 16:10:54 +01:00
  • b04d5b410a made postfix better Karma Riuk 2025-03-14 15:23:12 +01:00
  • 42d734877e made better length of dataset Karma Riuk 2025-03-14 15:21:42 +01:00
  • 76b79105ef fixed typing of field Karma Riuk 2025-03-14 15:21:36 +01:00
  • f5275b3c1b making assumption that the entry will be successful, then setting it to false when something goes wrong Karma Riuk 2025-03-14 15:20:59 +01:00
  • 6c6880715e unified the interface of funcitons in utils (made clone return a bool that indicates success of cloning) Karma Riuk 2025-03-14 15:14:48 +01:00
  • 8f854cfa09 extracted function Karma Riuk 2025-03-14 15:03:32 +01:00
  • 4544922165 extracted a function, updated it, and using it in pull_requests Karma Riuk 2025-03-14 14:38:24 +01:00
  • fd82ff5128 moved github logging to file Karma Riuk 2025-03-14 14:07:31 +01:00
  • 8e1bbd15ff removed useless paramater Karma Riuk 2025-03-14 14:04:50 +01:00
  • cc8cb7ef52 extracted function to utils Karma Riuk 2025-03-14 14:04:41 +01:00
  • 5a2865a78d minor stuff Karma Riuk 2025-03-14 14:03:04 +01:00
  • 4b06d4920b removed tqdm because it was too distracting Karma Riuk 2025-03-14 14:02:38 +01:00
  • c1c2106a86 made the has_only_1_comment function potentially verbose Karma Riuk 2025-03-14 12:06:03 +01:00
  • 9982e2bcc9 added the diff class to dataset entries Karma Riuk 2025-03-14 12:05:39 +01:00
  • 6437efaa54 first clean version of storing the entries of the dataset, still need to check for tests Karma Riuk 2025-03-14 12:05:19 +01:00
  • 468f793ded added the possibility of having the lenght of the dataset Karma Riuk 2025-03-14 11:59:37 +01:00
  • 242d7f6a77 made some entries to the dataset default to "" Karma Riuk 2025-03-14 11:59:21 +01:00
  • d845ce9171 removed sorting of the commit dates because it isn't useful Karma Riuk 2025-03-14 10:07:07 +01:00
  • 8f4614f71a removed useless import Karma Riuk 2025-03-14 09:55:11 +01:00
  • c035bfbe77 updated stats_pull_requests to use the new utils interface Karma Riuk 2025-03-14 09:49:50 +01:00
  • 115e275479 forgot to actually check whether the ammount of comments was 1, now it's fixed Karma Riuk 2025-03-14 09:48:57 +01:00
  • 49e8e29e7a updated the type annotations for the utils functions, much better now Karma Riuk 2025-03-14 09:48:38 +01:00
  • 9f8884a1e9 created the dataclasses for the "triplets" Karma Riuk 2025-03-14 09:26:28 +01:00
  • 873f712640 forgot an import Karma Riuk 2025-03-13 14:25:25 +01:00
  • 536e6e3e3f extracted some functions to a utils file, since they will be usefull for the new pull_requests.py Karma Riuk 2025-03-13 11:31:27 +01:00
  • 3a1b24dcb3 actually the file shouldn't be called 'good_repos.csv' because there aren't only good repos, so calling it 'repos.csv' Karma Riuk 2025-03-13 11:25:13 +01:00
  • f8568fe8ec cleared pull_requests.py to start anew Karma Riuk 2025-03-13 11:11:21 +01:00
  • 02d6c035fd renamed the results file, because "results.csv" was too generic Karma Riuk 2025-03-13 11:09:10 +01:00
  • e69e81e5b3 minor fixes Karma Riuk 2025-03-12 10:29:51 +01:00
  • 8c1de40a1f fixed the way to check whether a pr had only one comment Karma Riuk 2025-03-11 14:27:18 +01:00
  • 7c7b8801eb switched lines because it made more sense Karma Riuk 2025-03-11 14:23:30 +01:00
  • f76cd6f36f added entry to exclusion list Karma Riuk 2025-03-11 10:26:44 +01:00
  • a22776a96c fixed the stdout and file logging issue Karma Riuk 2025-03-10 11:29:09 +01:00
  • 5416b2ba15 saving the stats at each repo, so that we can see the progress Karma Riuk 2025-03-10 10:42:56 +01:00
  • fc0fdd2e43 added entry to exclusion list Karma Riuk 2025-03-10 10:39:13 +01:00
  • 8b5bae858e getting stats for repos that actually have tests Karma Riuk 2025-03-10 10:38:56 +01:00
  • 500531c8b0 apparently some dates are now parsed correctly when the json is parsed, but others are still in string form, so i'm trying to account for that Karma Riuk 2025-03-10 10:28:29 +01:00
  • aa5ed85717 made better post fix Karma Riuk 2025-03-10 10:28:13 +01:00
  • 527f9ea1af made logging better to avoid messing up the progress bar Karma Riuk 2025-03-10 10:27:43 +01:00
  • d34b3dbb20 made stats for PRs better, because i was not considering all edge cases Karma Riuk 2025-03-10 09:54:21 +01:00
  • 3a5cd998fd added the one hour limit to compilation as well Karma Riuk 2025-03-07 08:59:50 +01:00
  • 9fa7dd53af now using the PyGithub library, so that it handles for me the issue of too many requests in too little time Karma Riuk 2025-03-06 16:36:53 +01:00
  • 57b0f0c2cd first draft of getting the stats for pull requests Karma Riuk 2025-03-06 16:26:33 +01:00
  • 614115123e can now process repo (still needs testing) Karma Riuk 2025-03-06 15:39:04 +01:00
  • 3cbb81894d removed useless stuff Karma Riuk 2025-03-06 10:09:21 +01:00
  • a983e2c122 extracting the file before, comment and file after for the comments in a PR Karma Riuk 2025-03-06 10:01:59 +01:00
  • be1be25131 fixed lazy loading Karma Riuk 2025-03-06 09:53:36 +01:00
  • fd022ae8bb made the lazy loading better Karma Riuk 2025-03-05 12:12:15 +01:00
  • 470856c1a7 created a timeout to stop tests from running when they take ages Karma Riuk 2025-03-05 11:57:20 +01:00
  • 6857dbcff9 removed the count with grep since it's useless Karma Riuk 2025-03-05 11:53:22 +01:00
  • d9c100a811 finally get the overlapping commits and comments Karma Riuk 2025-03-05 10:22:55 +01:00
  • 9ab2036bc7 idk how to name variables Karma Riuk 2025-03-05 09:57:55 +01:00
  • ab88018f16 renamed variable Karma Riuk 2025-03-05 09:56:13 +01:00
  • b9501f9aac compacted code Karma Riuk 2025-03-05 09:55:19 +01:00
  • cb7c1a94a1 augmenting the comments with the range of the hunk, useful for later to search for overlap between comment and commit Karma Riuk 2025-03-05 09:54:53 +01:00
  • 9122b27140 renamed function Karma Riuk 2025-03-05 09:48:23 +01:00
  • a0ac5b47ad actually using the extracted function Karma Riuk 2025-03-05 09:30:04 +01:00
  • 788f8db974 fixed the extraction of the date, because on the server, datetime.fromisoformat couldn't be parsed for some reason Karma Riuk 2025-03-05 09:29:03 +01:00
  • ceb15674ad first version of the pull_requests.py Karma Riuk 2025-03-05 09:22:02 +01:00
  • 65b7a6423f made postfix better Karma Riuk 2025-03-05 09:08:16 +01:00
  • 67bbd50387 updated maven test extraction handler Karma Riuk 2025-03-02 23:32:31 +01:00
  • 0c802aca1c made gradle test extraction more robust and less prone to failures (exceptions) Karma Riuk 2025-03-02 23:31:49 +01:00
  • fd3c99001a updated comment Karma Riuk 2025-03-02 15:52:15 +01:00
  • ec216e802a removed useless import Karma Riuk 2025-03-02 12:03:08 +01:00
  • 351caac513 fixed lazy processing Karma Riuk 2025-03-02 12:01:43 +01:00
  • 02ec9b667a removed useless comment Karma Riuk 2025-03-02 11:01:53 +01:00
  • b1d98df292 implemented lazy evaluation (ignore repo if already processed) Karma Riuk 2025-03-02 11:01:12 +01:00
  • fb220eac1e added entry to exclusion list Karma Riuk 2025-03-02 10:33:20 +01:00
  • 9f504bde78 renamed clone repos function because it was a bit out of date Karma Riuk 2025-03-02 10:32:51 +01:00
  • 3d33cc941b made postfix better Karma Riuk 2025-03-01 23:12:49 +01:00
  • cbdf9db7a9 updated requirements Karma Riuk 2025-03-01 23:00:58 +01:00
  • 2a59fdf76c now extracting the number of tests for gradle as well Karma Riuk 2025-03-01 22:55:54 +01:00
  • e74aa5042e fixed division by zero Karma Riuk 2025-03-01 19:50:29 +01:00
  • 3740c89ed6 added extraction of number of tests (both from output and grep) Karma Riuk 2025-03-01 18:29:04 +01:00
  • 177f305d4c made posfix better Karma Riuk 2025-03-01 18:28:28 +01:00
  • a8adf2fb47 Merge branch 'handler-class' Karma Riuk 2025-03-01 17:25:26 +01:00
  • 59055df8b4 added logging of the number of gradle projects in progress bar Karma Riuk 2025-03-01 17:25:06 +01:00
  • 4f5695d8d2 extracted the logic of handling maven or gradle to a simple class structure Karma Riuk 2025-03-01 17:24:12 +01:00
  • 7028b0a5c0 fixed docker files Karma Riuk 2025-03-01 17:04:15 +01:00
  • 29541a15e9 now removing from output also the list of unapproved licenses Karma Riuk 2025-03-01 15:57:28 +01:00
  • c8ec782adf forgot to save when not interrupted Karma Riuk 2025-03-01 15:52:01 +01:00
  • 7b389c87d2 split dockerfile for maven and gradle separately Karma Riuk 2025-03-01 15:32:14 +01:00
  • ff9b4e0cdb excluding build.xml because apparenlty it's not for maven Karma Riuk 2025-03-01 15:05:28 +01:00
  • ec37ee47b9 fixed maven version not being new enough for a lot of projects Karma Riuk 2025-03-01 14:58:35 +01:00
  • 5ac5df42a0 made a better clean of the output Karma Riuk 2025-03-01 14:58:15 +01:00
  • 3ff9ebc7e4 made maven command better Karma Riuk 2025-03-01 14:57:19 +01:00
  • 65e4a45c9a made the "good_repo_for_crab" respect the standard of the other columns Karma Riuk 2025-03-01 12:57:07 +01:00
  • 3f388e3c0c cleaning the output of build / test properly Karma Riuk 2025-03-01 12:56:45 +01:00
  • e6f28f1c23 now logging the build system (i forgot earlier) Karma Riuk 2025-03-01 12:47:18 +01:00
  • 9ac7f9edb2 removed colour from output of gradle and maven Karma Riuk 2025-03-01 12:45:14 +01:00