INDEX
Explanations
instances of items being categorized into different types or groups
structures and classifications involving numbers and categories
New Auto-Interp
Negative Logits
vez
-0.74
amaru
-0.72
brance
-0.72
hire
-0.69
reb
-0.67
potion
-0.67
hers
-0.66
encer
-0.64
$$
-0.64
ifice
-0.64
POSITIVE LOGITS
kinds
1.25
types
1.24
distinct
1.18
phases
1.18
tiers
1.06
main
1.06
categories
1.04
stages
1.02
different
1.01
ways
1.00
Activations Density 0.157%