INDEX
Explanations
phrases indicating importance or significance
the repeated use of the word "and" in various contexts
New Auto-Interp
Negative Logits
LOCK
-0.72
YP
-0.71
Ĥİ
-0.70
Shut
-0.69
uta
-0.66
STE
-0.66
è»
-0.65
Cub
-0.64
lace
-0.63
Els
-0.62
POSITIVE LOGITS
hence
1.40
consequently
1.35
therefore
1.35
thus
1.23
secondly
1.13
furthermore
1.04
thereby
1.02
moreover
1.00
then
0.94
although
0.93
Activations Density 0.670%