INDEX
Explanations
descriptive words or phrases indicating a strong sense of need or urgency
expressions of urgent need or desperate situations
New Auto-Interp
Negative Logits
ences
-0.85
essee
-0.80
insula
-0.80
animous
-0.72
arthed
-0.71
ence
-0.71
arth
-0.66
citation
-0.66
encer
-0.66
ittal
-0.65
POSITIVE LOGITS
ãĥ£
0.87
needed
0.81
patched
0.75
pleas
0.75
ãĤ§
0.73
needed
0.73
phans
0.71
ache
0.71
fought
0.70
ãĤ©
0.70
Activations Density 0.011%