We should look into some heuristics to detect spam better. We could look for repeated words, levenshtein distance, message length, excessive usage of non BMP characters, maybe a rudimentary language filter looking for offensive words.
It would also be extremely beneficial if we could tie the detection to a shared backend between all servers so cross server spam is noticed too.