INDEX
Explanations
historical figures and their relationships within royal lineages and nobility
New Auto-Interp
Negative Logits
Sist
-0.57
Carthag
-0.57
Италијани
-0.57
Anſ
-0.56
applau
-0.55
neceff
-0.55
liſh
-0.55
Houſe
-0.54
[@BOS@]
-0.54
<unused14>
-0.54
POSITIVE LOGITS
mourut
0.43
particuliers
0.40
siglos
0.38
poète
0.37
africain
0.35
particulières
0.35
poème
0.35
réfugiés
0.35
mismo
0.34
von
0.33
Activations Density 0.351%