INDEX
Explanations
occurrences of the word "only."
New Auto-Interp
Negative Logits
åıªæĺ¯
-0.17
are
-0.16
lez
-0.16
zan
-0.16
zelf
-0.15
onders
-0.15
antine
-0.14
Ñĸв
-0.14
iare
-0.14
ãĤĤãģªãģĦ
-0.14
POSITIVE LOGITS
fans
0.25
Fans
0.24
íģ¼
0.21
rarely
0.20
partially
0.20
partly
0.17
yyyy
0.17
yyy
0.17
baÅŁÄ±na
0.17
ness
0.17
Activations Density 0.085%