Fix bug in normalize_and_hash_email_address function #752

arnau126 · 2023-02-25T16:28:18Z

First commit

If email_address has no "@" in it, the function raises IndexError list index out of range in:

is_gmail = re.match(r"^(gmail|googlemail)\.com$", email_parts[1])

because email_parts is a list with only one item.

This PR fix this bug by moving the above line inside the if-block if len(email_parts) > 1 .

Second commit

I've added strip to email_address to honor the Enhanced Conversions doc which says:

[...] In order to standardize the hash results, prior to hashing one of these values you must:

Remove leading/trailing whitespaces.

[...]

google-cla · 2023-02-25T16:28:21Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

arnau126 · 2023-02-26T10:29:18Z

About the second commit, I've just realized that the function calls normalize_and_hash which already does the strip. However I think that we should keep this extra strip in the very beginning of the function so a string like "[email protected] " (with a trailing whitespace) works.
If we don't perfom this previous strip, the regex won't match, and the period won't be removed.

BenRKarl · 2025-03-13T15:02:11Z

Question - are these changes just to address email address that don't have @ symbols in them?

arnau126 · 2025-03-17T10:30:24Z

@BenRKarl
Yes.

The original function already tries to do so, but in a wrong way:

    email_parts = normalized_email.split("@")
    is_gmail = re.match(r"^(gmail|googlemail)\.com$", email_parts[1])

    # Check that there are at least two segments and the second segment
    # matches the above regex expression validating the email domain name.
    if len(email_parts) > 1 and is_gmail:

I've just moved the is_gmail regex inside the if so it's only calculated if there are at least two segments.

    email_parts = normalized_email.split("@")

    # Check that there are at least two segments
    if len(email_parts) > 1:
        # Checks whether the domain of the email address is either "gmail.com"
        # or "googlemail.com". If this regex does not match then this statement
        # will evaluate to None.
        if re.match(r"^(gmail|googlemail)\.com$", email_parts[1]):

BenRKarl · 2025-04-11T13:31:05Z

@arnau126 ok great, sorry for the delay. Looks like these changes are being added to now-deleted files. Could you move them over to the new files where this validation occurs?

remarketing/upload_enhanced_conversions_for_leads.py
remarketing/upload_enhanced_conversions_for_web.py

…xamples.

…emarketing examples.

arnau126 · 2025-04-11T14:33:11Z

@BenRKarl
Done (and rebased).

arnau126 requested a review from a team as a code owner February 25, 2023 16:28

arnau126 requested review from AnashOommen and BenRKarl February 25, 2023 16:28

arnau126 force-pushed the main branch from eb36da5 to e028390 Compare March 2, 2023 10:19

arnau126 added 2 commits April 11, 2025 16:27

Fix bug in normalize_and_hash_email_address function of remarketing e…

e024c04

…xamples.

Strip email_address in normalize_and_hash_email_address function of r…

f9f5f6d

…emarketing examples.

arnau126 force-pushed the main branch from e028390 to f9f5f6d Compare April 11, 2025 14:32

BenRKarl approved these changes Apr 11, 2025

View reviewed changes

BenRKarl merged commit bbe1466 into googleads:main Apr 11, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bug in normalize_and_hash_email_address function #752

Fix bug in normalize_and_hash_email_address function #752

Uh oh!

arnau126 commented Feb 25, 2023

Uh oh!

google-cla bot commented Feb 25, 2023

Uh oh!

arnau126 commented Feb 26, 2023

Uh oh!

BenRKarl commented Mar 13, 2025

Uh oh!

arnau126 commented Mar 17, 2025 •

edited

Loading

Uh oh!

BenRKarl commented Apr 11, 2025

Uh oh!

arnau126 commented Apr 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix bug in normalize_and_hash_email_address function #752

Fix bug in normalize_and_hash_email_address function #752

Uh oh!

Conversation

arnau126 commented Feb 25, 2023

First commit

Second commit

Uh oh!

google-cla bot commented Feb 25, 2023

Uh oh!

arnau126 commented Feb 26, 2023

Uh oh!

BenRKarl commented Mar 13, 2025

Uh oh!

arnau126 commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenRKarl commented Apr 11, 2025

Uh oh!

arnau126 commented Apr 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

arnau126 commented Mar 17, 2025 •

edited

Loading