INDEX
Explanations
phrases related to contrasting or emphasizing information
repetitive phrases that include the word "this" followed by a variety of common conjunctions or prepositions
New Auto-Interp
Negative Logits
href
-0.58
agall
-0.57
appro
-0.57
GI
-0.54
ga
-0.53
Near
-0.52
pickup
-0.52
abduct
-0.52
gd
-0.51
Names
-0.51
POSITIVE LOGITS
coupled
0.94
notwithstanding
0.80
however
0.74
incidentally
0.73
along
0.72
sadly
0.71
besides
0.71
combined
0.70
alas
0.69
suffice
0.68
Activations Density 0.070%