INDEX
Explanations
phrases related to reduction or decrease in size or quantity
terms related to reduction or decrease in size
New Auto-Interp
Negative Logits
POL
-0.80
raid
-0.74
":[{"-0.63
PLIC
-0.63
cause
-0.62
shows
-0.62
role
-0.61
iqu
-0.61
Pa
-0.61
Drama
-0.61
POSITIVE LOGITS
glers
0.89
ular
0.81
violet
0.80
downs
0.79
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
0.76
shrink
0.75
lasses
0.74
eness
0.74
shrinking
0.74
abies
0.74
Activations Density 0.042%