INDEX
Explanations
repeated phrases or patterns in the text
Followed by underscores or whitespace
formatting separators
New Auto-Interp
Negative Logits
كومونز
-0.83
!*\
-0.82
Linus
-0.78
estimés
-0.73
ệc
-0.73
BoxFit
-0.72
Morfologia
-0.71
nią
-0.70
toHaveBeen
-0.70
">—
-0.70
POSITIVE LOGITS
................
1.16
________________
0.90
----------------
0.82
################
0.81
0.75
================
0.69
../../
0.69
…………………………………………
0.66
………………………………
0.65
0.64
Activations Density 0.491%