INDEX
Explanations
references to academic paper sections or citation formats
New Auto-Interp
Negative Logits
Trost
-0.56
外部連結
-0.54
compro
-0.51
znam
-0.49
trust
-0.48
ịnh
-0.48
Drey
-0.46
phyr
-0.46
tay
-0.46
-------------</
-0.45
POSITIVE LOGITS
pp
3.42
pp
2.34
Pp
2.05
PP
2.01
PP
1.86
Pp
1.74
ppc
1.10
pages
1.09
Pages
1.06
pgs
1.01
Activations Density 0.045%