INDEX
Explanations
professional statements or press releases
repeated phrases involving "in a" and "a" within various statements
New Auto-Interp
Negative Logits
Exper
-0.72
models
-0.66
rity
-0.64
experimented
-0.64
killers
-0.64
Mods
-0.63
fights
-0.63
Avenger
-0.62
âī
-0.62
enemy
-0.62
POSITIVE LOGITS
statement
1.55
memo
1.10
memorandum
1.06
tweet
1.06
letter
1.03
press
1.03
telephone
1.03
blog
1.02
written
1.01
statement
0.99
Activations Density 0.066%