INDEX
Explanations
phrases related to categories and classifications of items or concepts
New Auto-Interp
Negative Logits
Types
-0.44
Typen
-0.44
Types
-0.39
telle
-0.37
TYPES
-0.36
Typical
-0.36
типи
-0.36
telles
-0.35
subtypes
-0.33
khas
-0.32
POSITIVE LOGITS
kind
1.95
kind
1.52
sort
1.41
KIND
1.31
Kind
1.27
Kind
1.25
KIND
1.19
kinda
1.17
sort
1.07
sorta
1.07
Activations Density 0.223%