INDEX
Explanations
phrases related to replacement or substitution with something new
phrases related to replacing old things with new alternatives
New Auto-Interp
Negative Logits
awar
-0.71
ibrary
-0.65
Plot
-0.62
TOR
-0.61
ipl
-0.60
verbal
-0.60
warn
-0.59
banks
-0.57
Words
-0.57
tip
-0.56
POSITIVE LOGITS
newer
1.37
new
1.19
simpler
1.06
softer
1.04
cleaner
1.02
healthier
0.98
safer
0.97
nicer
0.93
brighter
0.91
modern
0.90
Activations Density 0.325%