INDEX
Explanations
information related to political elections and sporting events
occurrences of the word "in" within various contexts
New Auto-Interp
Negative Logits
LU
-0.72
linger
-0.69
REDACTED
-0.68
rang
-0.67
BILITIES
-0.65
ior
-0.63
awei
-0.62
ÑĮ
-0.62
ouf
-0.61
çīĪ
-0.60
POSITIVE LOGITS
spite
1.11
favor
1.04
terms
0.97
favour
0.95
comparison
0.88
lieu
0.87
escap
0.86
vitro
0.86
succession
0.86
front
0.84
Activations Density 0.200%