INDEX
Explanations
references to "first time" experiences or events
first time experiences
New Auto-Interp
Negative Logits
betrek
-0.49
henvis
-0.44
attutto
-0.43
escenas
-0.43
gruesa
-0.43
ジェクト
-0.42
strijd
-0.42
invloed
-0.42
voedsel
-0.42
serpiente
-0.41
POSITIVE LOGITS
مشين
0.70
!*\
0.60
OGND
0.59
########.
0.59
NameInMap
0.59
***!
0.56
autorytatywna
0.55
Personensuche
0.53
第一次
0.52
GEBURTS
0.51
Activations Density 0.005%