INDEX
Explanations
codes or acronyms related to specific organizations or places
references to specific organizations or entities indicated by "DF."
New Auto-Interp
Negative Logits
sole
-0.78
Metatron
-0.70
stakes
-0.66
bell
-0.66
amen
-0.64
bells
-0.63
Answer
-0.62
comes
-0.60
swick
-0.60
frames
-0.60
POSITIVE LOGITS
DF
0.95
ried
0.94
erm
0.93
rost
0.92
avorite
0.92
WD
0.90
amily
0.89
actory
0.89
sg
0.87
iants
0.86
Activations Density 0.016%