INDEX
Explanations
mentions of the name "Johnson"
occurrences of the term "honor" and its variations
New Auto-Interp
Negative Logits
Annotations
-0.74
Threat
-0.74
âĹ¼
-0.73
Printed
-0.70
Mutant
-0.70
Ange
-0.67
Assembly
-0.66
Sorceress
-0.66
Zot
-0.64
SCP
-0.63
POSITIVE LOGITS
olulu
1.10
esty
1.00
orable
1.00
hon
0.99
anced
0.99
eteenth
0.96
onen
0.96
lon
0.95
imaru
0.94
eous
0.93
Activations Density 0.010%