INDEX
Explanations
references to abstract concepts and their relationships
New Auto-Interp
Negative Logits
hår
-0.74
gills
-0.66
idUser
-0.64
SAVINGS
-0.62
resistenza
-0.61
ásban
-0.59
beiten
-0.59
веси
-0.59
indietro
-0.59
Belles
-0.59
POSITIVE LOGITS
concepts
1.48
concept
1.33
Concepts
1.30
Concepts
1.27
CONCEPT
1.26
Concept
1.20
Concept
1.17
concepts
1.14
concept
1.12
CONCEPTS
1.05
Activations Density 0.041%