INDEX
Explanations
words related to turning or events that involve a change or twist
references to turns or directional changes
New Auto-Interp
Negative Logits
ropolitan
-0.75
inately
-0.70
è¦ļéĨĴ
-0.67
aimon
-0.66
anyahu
-0.65
capacity
-0.64
Mellon
-0.63
gdala
-0.62
llah
-0.62
franc
-0.62
POSITIVE LOGITS
buck
0.96
about
0.95
around
0.85
abouts
0.84
coat
0.82
bull
0.80
overs
0.79
ips
0.79
wheel
0.78
over
0.77
Activations Density 0.035%