INDEX
Explanations
occurrences of the word "is" and variations of "s"
New Auto-Interp
Negative Logits
الدراسه
-0.61
omnes
-0.56
Bhd
-0.55
جوايز
-0.54
apore
-0.53
ῷ
-0.52
ใหม่
-0.52
Horne
-0.52
깥
-0.51
habet
-0.51
POSITIVE LOGITS
Thats
1.00
Thats
0.99
thats
0.91
thats
0.84
שוליים
0.84
That
0.74
ValueGeneration
0.74
مشين
0.73
why
0.70
="@+
0.69
Activations Density 0.089%