INDEX
Explanations
phrases or words related to long-standing or enduring situations or phenomena
phrases related to longevity or duration
New Auto-Interp
Negative Logits
chat
-0.79
kay
-0.76
WAR
-0.70
nesota
-0.69
Balt
-0.68
Stars
-0.67
lez
-0.67
teasp
-0.66
MAC
-0.65
ramid
-0.65
POSITIVE LOGITS
stay
0.78
favourite
0.73
subsequ
0.70
aukee
0.70
staple
0.65
favorite
0.65
nem
0.65
Romanian
0.64
superpower
0.63
theless
0.61
Activations Density 0.041%