INDEX
Explanations
negatively impacting terms, especially related to health or politics
phrases related to short-term and long-term concepts, particularly in relation to time
New Auto-Interp
Negative Logits
Rica
-0.78
Cry
-0.70
Lanka
-0.67
GEAR
-0.66
Sharma
-0.64
KN
-0.64
SOL
-0.64
physic
-0.63
Reflect
-0.61
Reviews
-0.60
POSITIVE LOGITS
issue
1.04
sized
1.02
word
0.97
headed
0.97
named
0.96
division
0.96
thing
0.96
sounding
0.95
same
0.94
powered
0.93
Activations Density 0.259%