INDEX
Explanations
occurrences of the word "our" and related possessive pronouns
New Auto-Interp
Negative Logits
978
-0.17
ohana
-0.16
åij³
-0.15
anki
-0.14
lect
-0.14
deo
-0.14
olley
-0.14
VELO
-0.14
ấm
-0.14
ilo
-0.14
POSITIVE LOGITS
ITA
0.15
iner
0.15
jer
0.14
uros
0.14
aber
0.14
OI
0.14
isher
0.14
ocrates
0.13
ABI
0.13
illis
0.13
Activations Density 0.038%