INDEX
Explanations
phrases related to taking responsibility or action
phrases related to taking action or responsibility
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.68
raid
-0.66
orsche
-0.66
pport
-0.65
selage
-0.64
oc
-0.62
QR
-0.61
ores
-0.61
herent
-0.60
Rust
-0.60
POSITIVE LOGITS
aside
1.07
frog
1.02
forward
1.00
forth
0.97
foot
0.90
up
0.90
ashore
0.89
stones
0.81
down
0.80
into
0.78
Activations Density 0.030%