INDEX
Explanations
instances of interviews and related formats
New Auto-Interp
Negative Logits
anson
-0.17
едини
-0.15
Sug
-0.15
esen
-0.15
onda
-0.15
((__
-0.14
porad
-0.13
è«
-0.13
gan
-0.13
ottes
-0.13
POSITIVE LOGITS
392
0.17
icode
0.15
714
0.15
691
0.14
interviews
0.14
olem
0.14
üstü
0.14
LENG
0.14
ingleton
0.14
åıĸ
0.14
Activations Density 0.030%