INDEX
Explanations
phrases related to social interactions and empathy development
New Auto-Interp
Negative Logits
BuyableInstoreAndOnline
-0.63
ortium
-0.58
CAT
-0.56
\":
-0.55
Arri
-0.54
Shack
-0.54
Hitch
-0.53
Johns
-0.52
Journalism
-0.50
nutshell
-0.50
POSITIVE LOGITS
.''.
1.01
safely
0.93
freely
0.93
peacefully
0.82
without
0.81
efficiently
0.79
.</
0.79
securely
0.78
cheaply
0.77
whilst
0.77
Activations Density 0.371%