INDEX
Explanations
references to scientific publications and numerical data
New Auto-Interp
Negative Logits
[{
-0.61
StructEnd
-0.57
Xna
-0.47
Livre
-0.47
jock
-0.47
Entrega
-0.46
prek
-0.46
karna
-0.45
punish
-0.45
prefixer
-0.44
POSITIVE LOGITS
ViewImports
0.75
Rüyada
0.69
palsu
0.67
Beware
0.63
disambiguazione
0.61
FALSE
0.61
bogus
0.60
Beware
0.60
falsos
0.58
fakes
0.58
Activations Density 0.197%