INDEX
Explanations
specific entities or names, potentially related to events or locations
references to specific organizations or named entities
New Auto-Interp
Negative Logits
luaj
-0.87
ãĥ¯ãĥ³
-0.86
hole
-0.83
rote
-0.81
ahime
-0.81
intosh
-0.78
estern
-0.78
RAFT
-0.78
holes
-0.77
vernment
-0.76
POSITIVE LOGITS
Goods
0.76
SEAL
0.65
OPLE
0.64
poppy
0.64
born
0.63
Sons
0.62
Malfoy
0.62
Nieto
0.62
Debt
0.62
beam
0.61
Activations Density 0.030%