INDEX
Explanations
repeated phrases or patterns in text
New Auto-Interp
Negative Logits
醐
-0.67
{{$-0.63
{{$-0.60
gül
-0.59
Bisch
-0.58
ად
-0.57
point
-0.56
@@@@@@@@
-0.56
labelledby
-0.55
="{{$-0.55
POSITIVE LOGITS
...
2.96
....
2.33
…
2.32
!...
2.13
...
2.11
..."
2.11
(...
2.09
.....
2.08
,...
2.06
...)
2.06
Activations Density 0.136%