INDEX
Explanations
phrases related to the concept of change
New Auto-Interp
Negative Logits
amina
-0.84
ngth
-0.68
mination
-0.67
ç«
-0.67
çĦ
-0.65
APH
-0.63
vern
-0.62
Whale
-0.61
ross
-0.61
Ys
-0.61
POSITIVE LOGITS
drastically
0.92
radically
0.87
tack
0.85
dramatically
0.82
gears
0.80
iating
0.79
perceptions
0.75
diapers
0.75
nil
0.75
atile
0.74
Activations Density 0.531%