INDEX
Explanations
phrases related to change or progression
phrases related to the emergence or development of concepts and phenomena
New Auto-Interp
Negative Logits
Ukrain
-0.66
bor
-0.61
Dynam
-0.59
Crus
-0.58
ailability
-0.58
Rabbit
-0.57
Haw
-0.57
POW
-0.56
fix
-0.56
akedown
-0.56
POSITIVE LOGITS
actionDate
0.83
ocument
0.80
ctuary
0.72
thereto
0.70
ENCE
0.68
encia
0.68
thood
0.68
ounter
0.67
rils
0.66
heit
0.66
Activations Density 0.077%