INDEX
Explanations
mentions of specific locations or people's names
the word "ag" and its context in various usages or forms
New Auto-Interp
Negative Logits
reps
-0.67
Widow
-0.63
Bolt
-0.62
Templ
-0.61
bell
-0.60
mileage
-0.59
spinal
-0.58
LINE
-0.58
Infinite
-0.58
wre
-0.57
POSITIVE LOGITS
ag
4.13
ags
2.35
AG
1.98
agos
1.86
agg
1.84
agging
1.73
agin
1.72
agged
1.67
agy
1.61
agn
1.59
Activations Density 0.017%