INDEX
Explanations
football players and scores
New Auto-Interp
Negative Logits
കു
0.43
arsenic
0.41
Mika
0.41
Artifact
0.40
arów
0.40
蓬
0.38
object
0.38
सें
0.38
میک
0.38
React
0.37
POSITIVE LOGITS
sir
0.42
Sutton
0.41
Lunch
0.39
Ipswich
0.39
Pozn
0.39
Sir
0.38
Missouri
0.38
neutrophiles
0.37
ంచ్
0.37
Eis
0.37
Activations Density 0.002%