INDEX
Explanations
phrases that indicate a growth or increase in phenomena
New Auto-Interp
Negative Logits
VICE
-0.72
chy
-0.66
MRI
-0.66
beit
-0.62
ãĥĩãĤ£
-0.62
cise
-0.61
ãĥĨãĤ£
-0.61
selves
-0.61
buff
-0.60
throats
-0.60
POSITIVE LOGITS
ivist
0.82
ancy
0.77
tide
0.77
rise
0.77
olithic
0.73
tides
0.72
xual
0.70
down
0.69
ups
0.69
anqu
0.68
Activations Density 0.026%