INDEX
Explanations
Short repetitive patterns or phrases within sentences
occurrences of a specific character or symbol repeated in various contexts
New Auto-Interp
Negative Logits
Appl
-0.75
detail
-0.63
assemb
-0.62
aur
-0.61
expires
-0.60
enroll
-0.59
praying
-0.59
suit
-0.58
Aston
-0.58
apr
-0.57
POSITIVE LOGITS
ĺ
4.41
ľ
2.00
Ĺ
1.95
Ķ
1.88
Ļ
1.88
Ľ
1.84
ĸ
1.82
ij
1.80
Ŀ
1.79
IJ
1.78
Activations Density 0.041%