Here's an idea/question/postulation to consider...
Is it possible to reasonably classify malware based on the opcodes a binary utilizes as the input attributes to a bayesian classifier? Put another way for the less technical readership, is it possible to determine if a given program (binary) is good or bad based on the internal mechanics of that program when we compare it to the internal mechanics of other programs that are good and bad?

