INDEX
Explanations
occurrences of the word "we"
New Auto-Interp
Negative Logits
uxxxx
-0.64
I
-0.64
⃣
-0.62
️⃣
-0.59
سكانية
-0.58
CastException
-0.58
INCREF
-0.57
längerung
-0.57
matchCondition
-0.54
It
-0.54
POSITIVE LOGITS
we
2.73
WE
1.98
they
1.07
WE
1.07
welijk
0.92
weh
0.88
awe
0.79
wea
0.75
мы
0.73
wey
0.73
Activations Density 0.075%