`PMD` Comments Removal #889

Malmahrouqi3 · 2025-06-16T21:11:51Z

Description

This is subsequent to (#882) with the mere addition of filtering out inline comments and commented lines. For the PMD details, check out the original issue (#646)

To show off the difference:
Filter-off: pmd-old branch https://github.com/Malmahrouqi3/MFC-mo2/actions/runs/15693861383/job/44214858621
Filter-on: pmd-new branch https://github.com/Malmahrouqi3/MFC-mo2/actions/runs/15694477828/job/44216673771

Improves PMD.

.github/workflows/pmd.yml

sbryngelson · 2025-06-16T22:13:14Z

.github/workflows/pmd.yml

+                  else
+                      # Overwrite the original file with the processed content
+                      mv "$TMP_FILE" "$file"
+                      echo -e "Successfully processed $file"


Malmahrouqi3 · 2025-06-16T22:15:22Z

@sbryngelson the number of violations is identical (63) whether the filter is on or off.

sbryngelson · 2025-06-16T22:20:27Z

strange

Malmahrouqi3 · 2025-06-17T22:08:35Z

I guess it should now presumably detect more duplicate lines if they are exactly the same operation but split out into lines differently.

q_sf(j, k, l) = q_sf(j, k, l)+q_prim_vf0(mom_idx%beg)%sf(j, k, l)*fd_coeff_x(r, j)*q_prim_vf0(mom_idx%beg)%sf(r+j, k, l)+q_prim_vf0(mom_idx%beg+1)%sf(j, k, l)*fd_coeff_y(r, k)*q_prim_vf0(mom_idx%beg)%sf(j, r+k, l)+q_prim_vf0(mom_idx%end)%sf(j, k, l)*fd_coeff_z(r, l)*q_prim_vf0(mom_idx%beg)%sf(j, k, r+l)/y_cc(k)

sbryngelson · 2025-06-17T22:19:44Z

yes that's what i was thinking. you could even strip spaces (maybe?)

sbryngelson · 2025-06-17T22:19:57Z

this is actually a very valuable tool!

Malmahrouqi3 · 2025-06-17T22:47:34Z

I left behind = .or. .and. and few subtle things.
Other than that, all spaces should be taken off around math/comparison operators, inside indexing parentheses and brackets.

sbryngelson · 2025-06-17T22:52:14Z

I left behind = .or. .and. and few subtle things. Other than that, all spaces should be taken off around math/comparison operators, inside indexing parentheses and brackets.

yes agreed. so you already did this or not yet?

Malmahrouqi3 · 2025-06-17T22:54:01Z

yup, you can check out the last commit PMD check.

Malmahrouqi3 · 2025-06-17T22:55:11Z

Filter Full Implementation

                  sed -E '
                    # First handle & continuation style (modern Fortran)
                    :ampersand_loop
                    /&[[:space:]]*$/ {
                      N
                      s/&[[:space:]]*\n[[:space:]]*(&)?/ /g
                      tampersand_loop
                    }

                    # Handle fixed-form continuation (column 6 indicator)
                    :fixed_form_loop
                    /^[[:space:]]{0,5}[^[:space:]!&]/ {
                      N
                      s/\n[[:space:]]{5}[^[:space:]]/ /g
                      tfixed_form_loop
                    }

                    # Remove any remaining continuation markers
                    s/&//g

                    # Normalize spacing - replace multiple spaces with single space
                    s/[[:space:]]{2,}/ /g

                    # Remove spaces around mathematical operators
                    s/[[:space:]]*\*[[:space:]]*/*/g
                    s/[[:space:]]*\+[[:space:]]*/+/g
                    s/[[:space:]]*-[[:space:]]*/-/g
                    s/[[:space:]]*\/[[:space:]]*/\//g
                    s/[[:space:]]*\*\*[[:space:]]*/\*\*/g

                    # Remove spaces in common Fortran constructs (array indexing, function calls)
                    s/\([[:space:]]*([^,)[:space:]]+)[[:space:]]*,/(\1,/g      # First argument
                    s/,[[:space:]]*([^,)[:space:]]+)[[:space:]]*,/,\1,/g       # Middle arguments
                    s/,[[:space:]]*([^,)[:space:]]+)[[:space:]]*\)/,\1)/g      # Last argument
                    s/\([[:space:]]*([^,)[:space:]]+)[[:space:]]*\)/(\1)/g     # Single argument

                    # Remove spaces around brackets and parentheses
                    s/\[[[:space:]]*/</g
                    s/\[[[:space:]]*/>/g
                    s/\[[[:space:]]*/</g
                    s/[[:space:]]*\]/]/g
                    s/\([[:space:]]*/(/g
                    s/[[:space:]]*\)/)/g

                    # Remove spaces around comparison operators
                    s/[[:space:]]*<=[[:space:]]*/</g
                    s/[[:space:]]*>=[[:space:]]*/>/g
                    s/[[:space:]]*<[[:space:]]*/</g
                    s/[[:space:]]*>[[:space:]]*/>/g
                    s/[[:space:]]*==[[:space:]]*/==/g

                    # Remove full-line comments
                    /^\s*!/d
                    /^[cC*dD]/d
                    /^[ \t]*[cC*dD]/d

                    # Remove end-of-line comments, preserving quoted strings
                    s/([^"'\''\\]*("[^"]*")?('\''[^'\'']*'\''?)?[^"'\''\\]*)[!].*$/\1/
                  ' "$file" > "$TMP_FILE"

sbryngelson · 2025-06-18T03:03:23Z

i just realized that once you have removed any line continuations, you can delete all blank lines and spaces in the source code... they don't actually mean anything and the code doesn't have to compile, it just has to be parsed by PMD. by doing this we ensure we find all duplicate patterns

Malmahrouqi3 · 2025-06-18T07:43:12Z

Yup yup, it is kinda reasonable to think this way if you would rather unveil longer patterns than just ones with few lines.

sbryngelson · 2025-06-18T14:17:19Z

Yup yup, it is kinda reasonable to think this way if you would rather unveil longer patterns than just ones with few lines.

cool should be easier to do this as well. don't see a new commit yet but will look at the PR once it's updated. I did notice that the current run appears to have shorter "worst" offenders at the top of PMD output (fewer tokens). is this because you stripped their whitespace / comments and the code is the same as it was before (so # tokens dropped but the Fortran part is the same)? or is it not finding the worst offenders it was a few commits ago? I remember seeing > 200 tokens as the worst cases

Edit: I checked myself. The violations (the first few worst ones) are the same, so you didn't break anything 👍 .

Malmahrouqi3 · 2025-06-18T15:04:46Z

Added to remove blank lines
/^[[:space:]]*$/d

sbryngelson · 2025-06-18T18:46:36Z

Looks pretty good to me. Do you have any other modifications planned?

Malmahrouqi3 · 2025-06-18T18:48:50Z

Nope

Co-authored-by: mohdsaid497566 <[email protected]>

Malmahrouqi3 and others added 23 commits June 12, 2025 16:57

integrated pmd into CI (MFlowCode#646)

de3a040

create rulset file

a1fe811

corrected directory

8defa9a

changed ruleset pattern typo

1fde2bc

added rules to python and fortran

0332cf1

ruleset for py

9f46e71

individual rules

8c3fb08

java rules - errorprone

cd8e2a5

java rules

4db4277

old school integration of PMD into workflow

d6d3bc8

removed Detect File Changes

54a6fc9

changed to cat to display reports

515c32a

added java compiler as dependency

4f4134a

removed something

b3ae8fa

just checking syntax

085eaa5

set env var pmd=/pmd/bin/pmd

e3626f6

quick syntax correction

f022a85

made PMD_COMMAND globally recognized

4fdb0c3

corrected package path

c7c1bdb

moved alias command under Running PMD

393d69b

comments removal

7448609

comments removal 2

36fb29e

comments removal 3

76352dd

Malmahrouqi3 requested a review from sbryngelson as a code owner June 16, 2025 21:11

Malmahrouqi3 force-pushed the CI-pmd branch from 9e425be to 76352dd Compare June 16, 2025 21:52

sbryngelson reviewed Jun 16, 2025

View reviewed changes

.github/workflows/pmd.yml Show resolved Hide resolved

sbryngelson reviewed Jun 16, 2025

View reviewed changes

Malmahrouqi3 force-pushed the CI-pmd branch from 4ca8da1 to d355280 Compare June 16, 2025 22:20

Malmahrouqi3 and others added 9 commits June 17, 2025 16:13

Update pmd.yml

d4b77fb

Update pmd.yml

502ab2c

Merge branch 'master' into CI-pmd

23d44d3

Update pmd.yml

31b873c

Update pmd.yml

8cf02c0

Update pmd.yml

1008246

Update pmd.yml

cd8f908

more cleanup

ae4cbf5

tokens=20

27c9932

strip out majority of spaces

3c0682d

Malmahrouqi3 added 3 commits June 18, 2025 10:43

Update pmd.yml

d5f882e

Update pmd.yml

0806ba9

Update pmd.yml

972ae38

Merge branch 'master' into CI-pmd

f8527c2

sbryngelson merged commit 82b2dea into MFlowCode:master Jun 18, 2025
18 checks passed

Malmahrouqi3 mentioned this pull request Jun 19, 2025

Cleaned up two echo's off PMD.yml #894

Merged

prathi-wind pushed a commit to prathi-wind/MFC-prathi that referenced this pull request Jul 13, 2025

Improve PMD use (MFlowCode#889)

efa5465

Co-authored-by: mohdsaid497566 <[email protected]>

PMD Comments Removal #889

PMD Comments Removal #889

Uh oh!

Conversation

Malmahrouqi3 commented Jun 16, 2025 • edited by sbryngelson Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Uh oh!

sbryngelson Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

Malmahrouqi3 commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbryngelson commented Jun 16, 2025

Uh oh!

Malmahrouqi3 commented Jun 17, 2025

Uh oh!

sbryngelson commented Jun 17, 2025

Uh oh!

sbryngelson commented Jun 17, 2025

Uh oh!

Malmahrouqi3 commented Jun 17, 2025

Uh oh!

sbryngelson commented Jun 17, 2025

Uh oh!

Malmahrouqi3 commented Jun 17, 2025

Uh oh!

Malmahrouqi3 commented Jun 17, 2025

Uh oh!

sbryngelson commented Jun 18, 2025

Uh oh!

Malmahrouqi3 commented Jun 18, 2025

Uh oh!

sbryngelson commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Malmahrouqi3 commented Jun 18, 2025

Uh oh!

sbryngelson commented Jun 18, 2025

Uh oh!

Malmahrouqi3 commented Jun 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

`PMD` Comments Removal #889

`PMD` Comments Removal #889

Malmahrouqi3 commented Jun 16, 2025 •

edited by sbryngelson

Loading

Malmahrouqi3 commented Jun 16, 2025 •

edited

Loading

sbryngelson commented Jun 18, 2025 •

edited

Loading