INDEX
Explanations
references to the pronoun "it" in various contexts
New Auto-Interp
Negative Logits
Geplaatst
-0.79
Lähteet
-0.78
ویکیپدی
-0.73
autorytatywna
-0.72
мәкал
-0.69
cherchés
-0.67
Gost
-0.67
췄
-0.66
referenties
-0.65
Przypisy
-0.64
POSITIVE LOGITS
was
0.75
is
0.73
would
0.71
He
0.70
They
0.66
held
0.66
It
0.64
بأنه
0.62
will
0.61
has
0.60
Activations Density 0.347%