INDEX
Explanations
phrases related to expressing different viewpoints or arguments
instances of uncertainty or caution in statements
New Auto-Interp
Negative Logits
STATE
-0.74
roup
-0.67
=#
-0.60
guiActiveUnfocused
-0.60
MpServer
-0.60
awa
-0.59
ews
-0.59
bris
-0.59
../
-0.58
fty
-0.58
POSITIVE LOGITS
however
1.54
moreover
1.38
alas
1.31
meanwhile
1.23
incidentally
1.21
though
1.13
therefore
1.07
unsurprisingly
1.06
huh
1.04
according
1.04
Activations Density 0.388%