INDEX
Explanations
phrases related to actions and intentions of people
connections between actions and consequences
New Auto-Interp
Negative Logits
rique
-0.89
asio
-0.77
fronts
-0.76
rers
-0.66
hoe
-0.65
lance
-0.62
reinstated
-0.62
hei
-0.61
onde
-0.60
holders
-0.60
POSITIVE LOGITS
namely
0.92
viz
0.78
excluding
0.74
Whether
0.73
Magikarp
0.71
Including
0.70
except
0.67
whether
0.65
INCLUD
0.64
BUT
0.64
Activations Density 0.760%