INDEX
Explanations
the action of giving or the concept of generosity
occurrences of the word "give" and its variations
New Auto-Interp
Negative Logits
destro
-0.67
timer
-0.67
Sphere
-0.66
alde
-0.60
Area
-0.60
ateg
-0.60
area
-0.59
antha
-0.59
andowski
-0.58
nown
-0.58
POSITIVE LOGITS
birth
1.22
chase
1.12
rise
1.08
away
1.02
speeches
0.94
up
0.93
cred
0.88
preference
0.87
generously
0.87
permission
0.86
Activations Density 0.072%