INDEX
Explanations
instances of the word "like" in various contexts
New Auto-Interp
Negative Logits
eyse
-0.17
istrovstvÃŃ
-0.16
åł
-0.15
atron
-0.14
mony
-0.14
kem
-0.14
ucu
-0.14
roles
-0.14
ÅĻed
-0.14
ÑĢемÑı
-0.14
POSITIVE LOGITS
utta
0.17
Ñĥв
0.15
Erf
0.15
ë°
0.15
Äįek
0.14
Wrest
0.14
vern
0.14
Heller
0.14
Orr
0.13
اط
0.13
Activations Density 0.040%