INDEX
Explanations
words related to flowing water or movement
references to cascading effects or sequences of events involving specific locations and individuals
New Auto-Interp
Negative Logits
selage
-0.90
ament
-0.85
nown
-0.82
fare
-0.77
umer
-0.76
anic
-0.76
nel
-0.75
wo
-0.74
gas
-0.74
orious
-0.73
POSITIVE LOGITS
downhill
0.66
htt
0.65
downgrade
0.65
zbollah
0.63
dale
0.62
ppers
0.62
descent
0.59
TIME
0.59
dk
0.58
interstitial
0.58
Activations Density 0.044%