INDEX
Explanations
proper nouns related to specific cultural or regional references
words related to the French language and its dialects
New Auto-Interp
Negative Logits
bott
-0.75
RELEASE
-0.74
Doug
-0.72
Clapper
-0.72
Bung
-0.71
Geoff
-0.70
/-
-0.70
cliffe
-0.70
Hig
-0.69
Pip
-0.69
POSITIVE LOGITS
ia
1.20
ian
1.14
ians
1.10
iosis
1.03
idian
1.01
ese
1.00
iac
0.97
ium
0.95
iah
0.94
ien
0.92
Activations Density 0.216%