Thanks to visit codestin.com
Credit goes to github.com

Skip to content

[BUG] Semantically incorrect feature in Fraud detection notebook #967

@loomlike

Description

@loomlike

Willingness to contribute

Yes. I can contribute a fix for this bug independently.

Feathr version

0.9.0

System information

NA

Describe the problem

Transaction time column in the sample data contains Integer values of the transaction time-of-a-day in a format of HHmmss.
Since this is an integer (not string), we use that as feature as it is and compute averaged feature value as well.

However, to make it more semantically accurate, this should be converted into string and parsed as the time-of-a-day.

From the data science point of view, we may use hour of a day as the feature to detect if the given transactions happened within "usual" time-of-a-day.

For the aggregation feature, median value makes more sense than average, but currently we don't support Med aggregation key.

Tracking information

No response

Code to reproduce bug

No response

What component(s) does this bug affect?

  • Python Client: This is the client users use to interact with most of our API. Mostly written in Python.
  • Computation Engine: The computation engine that execute the actual feature join and generation work. Mostly in Scala and Spark.
  • Feature Registry API: The frontend API layer supports SQL, Purview(Atlas) as storage. The API layer is in Python(FAST API)
  • Feature Registry Web UI: The Web UI for feature registry. Written in React

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions