INDEX
Explanations
phrases related to causing an effect or change
phrases indicating the act of bringing or causing change
New Auto-Interp
Negative Logits
schild
-0.70
livious
-0.69
debian
-0.67
uary
-0.67
dating
-0.67
raid
-0.65
codes
-0.64
mast
-0.63
living
-0.62
ocene
-0.61
POSITIVE LOGITS
forth
1.17
together
0.88
endum
0.78
forward
0.77
attention
0.77
up
0.75
EMENT
0.75
misfortune
0.70
smiles
0.69
hurst
0.69
Activations Density 0.036%