Hi all,
I'm currently working on my masterthesis. It's about whether the pandemic has influenced the method of phishing that is done by criminals. I've got a dataset on phishing and I'm currently doing a keyword analysis by how many times a certain keyword comes across in phishing emails on a given day. I've normalized the count by dividing it by the number of phishing emails on that particular day (since the amount of emails per day varies). So, now the data looks like the average count of a given keyword in a email per day.
Now I was wondering on how I can test this statistically, I'm thinking about a paired samples T-test. Where I've created 2 groups: before and after the start of the pandemic. Any suggestions?
Thank you!