INDEX
Explanations
short sentences, headlines, or labels at the beginning or end of text segments
occurrences of the word "This" at the beginning of sentences or clauses
New Auto-Interp
Negative Logits
sic
-0.79
srfAttach
-0.76
respons
-0.69
fully
-0.69
actionGroup
-0.68
encour
-0.68
forth
-0.66
versa
-0.65
externalToEVAOnly
-0.64
remem
-0.63
POSITIVE LOGITS
zbollah
0.91
Updated
0.83
Vegan
0.73
anmar
0.72
Transcript
0.70
Wrestling
0.70
Welcome
0.69
Expand
0.69
chwitz
0.69
Update
0.68
Activations Density 0.283%