INDEX
Explanations
phrases related to physical barriers or obstacles that impede movement or progress
New Auto-Interp
Negative Logits
olate
-0.82
olar
-0.67
arse
-0.65
oy
-0.64
lat
-0.63
sonian
-0.63
iquette
-0.62
elman
-0.61
worn
-0.61
char
-0.61
POSITIVE LOGITS
access
1.03
accessing
1.00
progress
0.99
entry
0.96
growth
0.88
uptake
0.87
indefinitely
0.87
Entry
0.86
deportation
0.84
flow
0.84
Activations Density 1.608%