INDEX
Explanations
phrases indicating someone is new to a platform or subject
New Auto-Interp
Negative Logits
èĨ
-0.16
MISS
-0.15
acc
-0.14
Barton
-0.14
665
-0.14
mos
-0.14
eut
-0.14
eph
-0.14
monic
-0.14
net
-0.13
POSITIVE LOGITS
bish
0.17
itere
0.15
ürger
0.15
oplevel
0.15
ling
0.14
paddingRight
0.14
ÃŃÅĻ
0.14
ograd
0.14
apons
0.14
ienen
0.14
Activations Density 0.034%