INDEX
Explanations
the presence of the word "in" used frequently
New Auto-Interp
Negative Logits
་་
-0.89
Chriftian
-0.83
itſelf
-0.80
Hadrian
-0.80
Monfieur
-0.79
Hopf
-0.79
―――――
-0.79
purpoſe
-0.77
Anſ
-0.77
Aphrodite
-0.76
POSITIVE LOGITS
the
1.19
in
1.15
IN
1.14
In
1.09
isIn
0.91
a
0.90
accordance
0.89
lieu
0.86
In
0.84
Dalam
0.84
Activations Density 1.106%