INDEX
Explanations
words related to events, celebrations, or public appearances
terms related to fare in various contexts
New Auto-Interp
Negative Logits
olved
-0.70
compuls
-0.69
redistributed
-0.68
stocking
-0.67
omez
-0.65
transitional
-0.63
subp
-0.62
ued
-0.60
otted
-0.60
finite
-0.59
POSITIVE LOGITS
fare
1.95
rior
0.87
enance
0.87
ttes
0.86
Fare
0.85
ments
0.84
nce
0.81
cade
0.78
ride
0.78
riors
0.75
Activations Density 0.012%