INDEX
Explanations
mentions of particular individuals or characters associated with specific traits or roles
New Auto-Interp
Negative Logits
iculty
-0.78
LCS
-0.73
AMY
-0.68
Compass
-0.68
reek
-0.67
REE
-0.65
riter
-0.64
Commonwealth
-0.64
Monstrous
-0.64
IRD
-0.62
POSITIVE LOGITS
aldo
0.96
atis
0.93
lich
0.88
ich
0.88
ci
0.87
zon
0.87
vana
0.85
Rin
0.85
zai
0.84
zeb
0.84
Activations Density 0.005%