INDEX
Explanations
phrases that indicate capability or potential actions related to people or objects
New Auto-Interp
Negative Logits
edes
-0.16
اج
-0.16
ilion
-0.16
rob
-0.16
relude
-0.16
ennie
-0.15
Ñıв
-0.15
еÑĢалÑĮ
-0.15
तर
-0.15
.restore
-0.15
POSITIVE LOGITS
Handle
0.16
opies
0.15
ä¼ı
0.15
handle
0.14
ÏĢÎŃ
0.14
.cross
0.14
rapid
0.14
Morg
0.14
erva
0.14
Holdings
0.14
Activations Density 0.294%