INDEX
Explanations
the word "turn" followed by a number indicating a significant action or change
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-1.05
ropolitan
-0.92
capacity
-0.82
mma
-0.80
inately
-0.78
ording
-0.76
foundation
-0.72
bour
-0.71
llah
-0.71
lain
-0.69
POSITIVE LOGITS
awa
0.87
into
0.85
coat
0.81
shif
0.77
agra
0.76
around
0.76
inward
0.75
crank
0.75
about
0.75
sour
0.74
Activations Density 8.651%