INDEX
Explanations
terms related to sudden changes or disruptions
New Auto-Interp
Negative Logits
loop
-0.80
propOrder
-0.77
loop
-0.76
loops
-0.65
Schwangerschaft
-0.64
igshid
-0.63
stationnement
-0.62
höhung
-0.62
artesanales
-0.59
esenciales
-0.58
POSITIVE LOGITS
lung
1.17
Lung
1.05
synchron
1.02
Synchron
1.01
lungs
0.97
synchronization
0.90
Lung
0.89
Sync
0.87
synchronize
0.86
synch
0.86
Activations Density 0.045%