INDEX
Explanations
phrases related to physical actions and interactions
New Auto-Interp
Negative Logits
↵↵
-0.70
-0.66
,
-0.66
L
-0.65
e
-0.64
P
-0.63
R
-0.62
heart
-0.62
-
-0.62
H
-0.60
POSITIVE LOGITS
myſelf
1.17
Efq
1.15
doubtnut
1.12
Theſe
1.10
مرئيه
1.07
BoxFit
1.01
Monfieur
1.01
berdayakan
1.01
itſelf
1.00
تانيه
0.99
Activations Density 0.438%