INDEX
Explanations
lines of text starting and ending with specific characters ('Ċ' and '|endoftext|') and containing various criteria or conditions
list-like structures or enumerations in text
New Auto-Interp
Negative Logits
hement
-0.83
senal
-0.79
ataka
-0.71
Thornton
-0.68
emancipation
-0.67
ĸļ
-0.66
lifes
-0.65
aido
-0.63
Guth
-0.61
orate
-0.61
POSITIVE LOGITS
Interstitial
0.90
;;
0.85
Liter
0.82
Introdu
0.79
FIN
0.77
âĸł
0.76
Height
0.76
Ur
0.75
Ingredients
0.75
âľ
0.74
Activations Density 0.230%