-
Notifications
You must be signed in to change notification settings - Fork 9.2k
HADOOP-18056. DistCp: Filter duplicates in the source paths. #3825
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
9766389 to
565b1f8
Compare
|
🎊 +1 overall
This message was automatically generated. |
565b1f8 to
a232e11
Compare
|
🎊 +1 overall
This message was automatically generated. |
| Set<Path> uniquePaths = new LinkedHashSet<>(); | ||
| for (Path path : srcPaths) { | ||
| if (!uniquePaths.add(path)) { | ||
| LOG.warn("Path: {} added multiple times, Ignoring the redundant entry.", path); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Do we need to change Ignoring to ignoring.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanx @tomscut, I have changed it.
|
🎊 +1 overall
This message was automatically generated. |
|
LGTM. |
| Set<Path> uniquePaths = new LinkedHashSet<>(); | ||
| for (Path path : srcPaths) { | ||
| if (!uniquePaths.add(path)) { | ||
| LOG.warn("Path: {} added multiple times, ignoring the redundant entry.", path); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should it log at info? I'm just worried because WARN Messages are often interpreted as a sign of serious problems; if this is happening due to some workflow then it's not going to go away.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, makes sense.
Thanx @steveloughran for the review. I have changed it to INFO
|
🎊 +1 overall
This message was automatically generated. |
steveloughran
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 from me
… Contributed by Ayush Saxena. Reviewed-by: tomscut <[email protected]> Reviewed-by: Steve Loughran <[email protected]>
… Contributed by Ayush Saxena. Reviewed-by: tomscut <[email protected]> Reviewed-by: Steve Loughran <[email protected]>
…3825). Contributed by Ayush Saxena. Reviewed-by: tomscut <[email protected]> Reviewed-by: Steve Loughran <[email protected]>
Description of PR
Filter duplicates from source paths
How was this patch tested?
UT
For code changes: