INDEX
Explanations
phrases describing specific attributes or features
quantities and measurements related to objects and their properties
New Auto-Interp
Negative Logits
projects
-0.89
OWS
-0.77
Train
-0.76
onse
-0.76
sports
-0.75
cakes
-0.75
bis
-0.75
Products
-0.71
Music
-0.71
arten
-0.70
POSITIVE LOGITS
tendency
1.25
lifespan
1.19
diameter
1.09
expiration
1.07
drawback
1.04
propensity
1.02
resemblance
1.02
radius
0.97
capacity
0.96
reputation
0.95
Activations Density 0.207%