INDEX
Explanations
phrases related to actions or behaviors within a story or narrative
punctuated statements or clauses, particularly those indicating disagreement or contrast
New Auto-Interp
Negative Logits
¬¼
-0.71
UF
-0.64
pec
-0.61
ety
-0.60
orn
-0.59
uy
-0.59
Accessory
-0.59
heast
-0.59
lf
-0.58
interstitial
-0.58
POSITIVE LOGITS
however
1.32
though
1.23
albeit
1.20
although
1.12
huh
1.03
but
1.03
meanwhile
0.97
whereas
0.94
namely
0.92
insofar
0.90
Activations Density 0.939%