INDEX
Explanations
words related to changes such as declines, falls, rises, and drops in various contexts
terms related to decreases or declines in various contexts
New Auto-Interp
Negative Logits
extras
-0.65
vandal
-0.64
rich
-0.63
Exile
-0.62
flix
-0.60
racuse
-0.58
quir
-0.57
exile
-0.56
subtitles
-0.56
Voc
-0.55
POSITIVE LOGITS
rate
0.93
ait
0.86
occurring
0.84
rates
0.83
wrought
0.81
lust
0.81
luster
0.78
ghai
0.78
coincided
0.76
between
0.75
Activations Density 0.197%