INDEX
Explanations
terms indicating a lack of awareness or understanding
New Auto-Interp
Negative Logits
صوتيه
-0.86
་་
-0.76
Majefty
-0.74
^(@)
-0.71
expandindo
-0.70
staande
-0.67
-0.66
―――――
-0.66
deſt
-0.65
Anſ
-0.64
POSITIVE LOGITS
]]
0.98
blind
0.80
Blind
0.69
+"
0.67
blind
0.59
度
0.56
Blind
0.55
</sub>
0.55
blindness
0.53
]],
0.52
Activations Density 0.108%