INDEX
Explanations
numerical values, particularly in a context related to measurement or evaluation
New Auto-Interp
Negative Logits
Skydragon
-0.80
uyomi
-0.77
conduc
-0.76
livest
-0.74
juggling
-0.73
myster
-0.68
headlights
-0.68
puzz
-0.68
hner
-0.68
contrace
-0.67
POSITIVE LOGITS
rican
0.98
Ñĭ
0.95
ERN
0.91
ric
0.90
س
0.89
rik
0.89
ÙĪ
0.89
rique
0.87
Ñĥ
0.86
ãĥ¼
0.85
Activations Density 0.003%