INDEX
Explanations
conjunctions used to connect ideas or phrases in a sentence
New Auto-Interp
Negative Logits
SourceFile
-0.80
ULAR
-0.62
cott
-0.61
hari
-0.60
bound
-0.60
CLOSE
-0.58
tnc
-0.57
ories
-0.57
ophone
-0.56
mates
-0.54
POSITIVE LOGITS
alas
0.80
beware
0.73
nonetheless
0.72
anecd
0.71
unlike
0.70
concedes
0.70
ignores
0.69
concede
0.69
âķIJâķIJ
0.66
rhet
0.66
Activations Density 0.046%