INDEX
Explanations
phrases related to personal relationships and interactions
New Auto-Interp
Negative Logits
Monfieur
-0.99
Chriſt
-0.94
purpoſe
-0.89
occaf
-0.88
greateſt
-0.82
ſeveral
-0.82
Majefty
-0.80
Perſ
-0.79
houſe
-0.77
Cæsar
-0.77
POSITIVE LOGITS
'):
0.78
'),
0.76
Sa
0.74
Se
0.73
'])
0.73
rospy
0.72
θα
0.72
)$_
0.72
'))
0.72
initro
0.72
Activations Density 0.041%