INDEX
Explanations
phrases related to balance and variety
New Auto-Interp
Negative Logits
olas
-0.16
onis
-0.15
vault
-0.15
ksen
-0.15
sted
-0.15
ÏģοÏħ
-0.14
.localization
-0.14
oleÄį
-0.14
ceil
-0.13
.ceil
-0.13
POSITIVE LOGITS
intermediate
0.78
Intermediate
0.67
Intermediate
0.64
intermediary
0.62
middle
0.60
intermedi
0.59
middle
0.50
intervening
0.50
between
0.49
ä¸Ń
0.47
Activations Density 0.200%