INDEX
Explanations
emotional states and themes of personal introspection
New Auto-Interp
Negative Logits
lied
-0.17
.ibatis
-0.16
loff
-0.15
,readonly
-0.15
uled
-0.15
Ùħبر
-0.15
åIJī
-0.14
ëĦIJ
-0.14
osate
-0.14
icana
-0.14
POSITIVE LOGITS
ness
0.42
fulness
0.41
iness
0.40
iveness
0.38
ality
0.36
liness
0.36
lessness
0.35
ity
0.32
ism
0.31
NESS
0.30
Activations Density 0.203%