INDEX
Explanations
phrases or sentences introducing additional information or details
references to additional information or topics
New Auto-Interp
Negative Logits
Consolid
-0.77
CHAT
-0.73
Jinn
-0.69
Inher
-0.68
Peaks
-0.67
Bers
-0.67
Kazakh
-0.65
Glover
-0.65
Story
-0.63
Kas
-0.62
POSITIVE LOGITS
than
1.14
than
1.06
efficient
0.87
fficient
0.79
ptive
0.79
ensed
0.78
ocations
0.77
erous
0.75
culated
0.74
venient
0.73
Activations Density 0.723%