INDEX
Explanations
the word "ara" with high frequency in the text
New Auto-Interp
Negative Logits
Efq
-0.72
IndentedString
-0.72
UpInside
-0.67
StructEnd
-0.66
neſs
-0.65
ſelf
-0.64
itſelf
-0.64
Reſ
-0.64
unſ
-0.63
himſelf
-0.63
POSITIVE LOGITS
TC
1.03
ara
0.89
TC
0.81
tc
0.76
|
0.66
\{\\0.66
Observe
0.65
tc
0.64
Observe
0.61
0.59
Activations Density 0.091%