INDEX
Negative Logits
advers
-0.08
controversy
-0.08
appellate
-0.08
shirk
-0.08
TERY
-0.08
steadfast
-0.07
entered
-0.07
ENDED
-0.07
conclusion
-0.07
ker
-0.07
POSITIVE LOGITS
installing
0.10
ightly
0.08
scripts
0.08
<!
0.08
-maker
0.08
Script
0.08
Maker
0.07
Scripts
0.07
linking
0.07
'''↵↵
0.07
Activations Density 0.002%