INDEX
Explanations
units, punctuation, technical terms
New Auto-Interp
Negative Logits
obiology
0.98
residency
0.94
managerial
0.93
ultrast
0.92
syntactic
0.91
づくり
0.90
algorithmic
0.90
cryptography
0.89
physiology
0.88
countermeasures
0.86
POSITIVE LOGITS
گاه
1.07
사항
0.99
Punkte
0.91
пункт
0.90
گاه
0.83
items
0.82
пунктов
0.80
пункта
0.78
物が
0.77
Punkten
0.77
Activations Density 0.582%