INDEX
Explanations
place names and identifiers for organizations and groups
British institutions and titles
New Auto-Interp
Negative Logits
SharedCtor
-0.34
wat
-0.31
spot
-0.28
kona
-0.28
kar
-0.26
Fre
-0.26
nezeu
-0.26
disambiguazione
-0.25
Sea
-0.25
Car
-0.25
POSITIVE LOGITS
AssemblyTitle
0.70
⟬
0.64
ptonshire
0.63
ModelExpression
0.62
0.59
كومونز
0.58
ondra
0.57
Sunak
0.55
classnames
0.54
hassee
0.54
Activations Density 0.266%