INDEX
Explanations
verbs related to taking action or making decisions
New Auto-Interp
Negative Logits
heid
-0.68
Lauder
-0.66
guiIcon
-0.62
icity
-0.60
ivity
-0.59
Domain
-0.58
CTV
-0.57
orial
-0.56
Plain
-0.56
ina
-0.55
POSITIVE LOGITS
up
1.14
up
1.07
ups
1.03
out
0.98
down
0.97
off
0.96
away
0.91
down
0.90
UP
0.88
Up
0.86
Activations Density 2.272%