INDEX
Explanations
actions involving playing or interacting with others
New Auto-Interp
Negative Logits
DockStyle
-0.62
EconPapers
-0.61
تفصیلات
-0.59
stateProvider
-0.59
society
-0.58
neceſſ
-0.55
neceffary
-0.54
înal
-0.53
faſt
-0.53
réc
-0.53
POSITIVE LOGITS
tinkering
1.12
manipulate
1.05
manipulations
1.02
manipulating
1.01
experimentation
1.00
manip
0.99
tinker
0.98
manip
0.98
manipulation
0.98
manipula
0.97
Activations Density 0.399%