Jump to content

Finding criteria for a household financial budget falsification


Recommended Posts

Posted (edited)

I’m working on a financial problem about budget of households. Households in a state fill a form about their net budget in every year and our insurance company investigate their financial status and find the exact amount of their budget. The net budget can be positive or negative. I’m designing a system with neural network that we can find the households that Falsify their net budget, So my output is a binary [ 0 and 1] which 1 is false net budget and 0 is true net budget.

Suppose that households’ net budget is P and our investigated outcome of net budget is R. As you know:

Now we have this:

R>=P

If P=R then  :  Criterion = 0 (true net budget)If P>0 and R>0   :  Criterion = R-P/R If P<0 and R<0   :  Criterion = (abs(P)-abs(R)) / abs(P) If P<0 and R>0    :  ????     (A lot of households have this situation and I dint find any criterion for it)If P<0 and R=0    :  ????     (Some of households have this situation and I dint find any criteria for it)If P=0 and R>0    :  ????      (Some of households have this situation and I dint find any criterion for it)

After finding these criterion that must be between 0 and 1 [0 1], I set a threshold (like 0.5), If the criterion is higher than 0.5 household is falsifying the net budget (output = 1) and If it is lower than 0.5 we have a true budget and there isn’t any problem in budget declaring of households (output = 0).

** In range of [0 1], if falsification is higher (depends of above functions - P and R) the creation goes toward 1 and if the household is honest it goes toward 0. So every household (sample) has a criterion between 0 and 1.

My main problem is finding a good criterion when (P<0 , R>0), After that (P<0 , R=0) and finally (P=0 , R>0). IF I have a problem in first, second and third criterion, feel free to say. I think the output of second situation is different from third because structures of criteria is different. I’m going to find a single criterion function If it is possible.

 

 

PS. I Have 2000 samples with different size of P and R.

 

 

Thank you so much for your helps.

Edited by jack_s
Posted

**Example :**
First conduction : P = 10 , R = 15 : Criteria : (15-10/15) = 33.3% ( falsification rate for this household )
Second conduction : P = -16, R = -10 : Criteria : (abs(-16)-abs(-10))/abs(-16) = 37.5% ( falsification rate for this household )

Now if for example P=-20 and R = 40, How we can calculate falsification rate? . This is main question. after that how we can calculate when (P<0 , R=0) and finally (P=0 , R>0)?

Posted (edited)

Is this for a college project or something similar? I know programming and the more crucial bug detection so I'll try to provide what help I can from that perspective.

 

The two examples provided are both claiming to be doing worse than what they are. You might want to try reversing the numbers to account for people claiming to be doing better than they are.

 

ie. P = 15, R = 10

 

You are looking for all your answers to fit into this format:

 

If abs(X) <= abs(Y) and Y != 0, then abs(X/Y) = [0 1]

 

I know

abs(P - R) / (abs(P) + abs(R))

will always give you "something" in that range(except when P=R=0, covered by your first case). I have no idea about how accurate those percentages will be though. May just need more conditionals instead depending on what you are after.

 

Sorry, I couldn't be of more help. We do have quite a few math geniuses here who can likely provide more help than I could though.

Edited by Endy0816

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.