INDEX
Explanations
phrases related to authority or oversight
repeated conjunctions or phrases emphasizing connections between ideas
New Auto-Interp
Negative Logits
DX
-0.87
Ĭ±
-0.76
BOOK
-0.73
shift
-0.73
NVIDIA
-0.73
Tes
-0.70
Ĥİ
-0.70
busters
-0.69
---------
-0.68
Times
-0.68
POSITIVE LOGITS
consequently
1.22
furthermore
1.14
thus
1.08
therefore
1.05
moreover
1.03
thence
1.03
thereby
0.99
assorted
0.98
secondly
0.97
hence
0.96
Activations Density 0.295%