INDEX
Explanations
variations of the word "give" in different contexts
New Auto-Interp
Negative Logits
ets
-0.17
iegel
-0.16
uga
-0.15
behalf
-0.15
hu
-0.14
ÑĢеп
-0.14
igel
-0.14
yx
-0.14
hus
-0.14
eya
-0.14
POSITIVE LOGITS
up
0.26
giving
0.22
give
0.21
Give
0.20
gave
0.20
Give
0.19
ousel
0.19
Giving
0.19
give
0.19
Giving
0.19
Activations Density 0.048%