INDEX
Explanations
phrases related to self-defense
New Auto-Interp
Negative Logits
XIII
-0.82
Ashe
-0.81
GOODMAN
-0.72
ICAN
-0.71
Nights
-0.70
Seasons
-0.66
IUM
-0.66
Orchestra
-0.65
BST
-0.65
mingham
-0.64
POSITIVE LOGITS
destruct
1.32
same
1.06
contained
1.02
conscious
0.99
destruct
0.98
absor
0.95
proclaimed
0.94
awareness
0.94
esteem
0.93
esteem
0.93
Activations Density 1.535%