INDEX
Explanations
references to colors, primarily focusing on variants of white
New Auto-Interp
Negative Logits
LError
-0.62
Leck
-0.60
Journeys
-0.59
decade
-0.59
Lend
-0.59
ScopeManager
-0.58
ويكيپيديا
-0.58
edip
-0.56
relationship
-0.56
Leap
-0.55
POSITIVE LOGITS
White
1.15
Putih
1.14
White
1.14
white
1.04
white
1.04
WHITE
1.02
WHITE
0.98
Whites
0.87
Whites
0.84
whites
0.82
Activations Density 0.117%