INDEX
Explanations
instances of special characters and formatting tokens in the text
New Auto-Interp
Negative Logits
samar
-0.65
lesssim
-0.64
ne
-0.63
bra
-0.63
model
-0.62
ser
-0.62
elry
-0.62
front
-0.61
ang
-0.61
Levin
-0.60
POSITIVE LOGITS
varandra
0.88
enumi
0.85
bakgrund
0.84
antaranya
0.81
consultato
0.78
maș
0.78
myö
0.76
häls
0.74
térmico
0.74
vulga
0.73
Activations Density 0.062%