INDEX
Explanations
Latin characters adjacent to each other
instances of critical or misleading information
New Auto-Interp
Negative Logits
oun
-0.76
hement
-0.72
xual
-0.71
destro
-0.64
itsu
-0.63
Leilan
-0.63
uto
-0.63
owicz
-0.62
honoured
-0.62
tsky
-0.62
POSITIVE LOGITS
¶
0.85
database
0.84
·
0.78
ccording
0.78
é¾
0.76
³³³³³³³³
0.75
³³³³³³³³³³³³³³³³
0.75
natureconservancy
0.73
yet
0.71
JUST
0.71
Activations Density 0.252%