INDEX
Explanations
references to attention and focus
terms related to attention and attention disorders
New Auto-Interp
Negative Logits
Mehran
-0.70
halves
-0.67
Tale
-0.66
tto
-0.65
Recon
-0.64
ourke
-0.62
Yugoslavia
-0.62
Castro
-0.61
raine
-0.61
Dani
-0.60
POSITIVE LOGITS
estinal
1.00
orial
0.86
attention
0.81
ively
0.80
arios
0.77
stadt
0.76
largeDownload
0.74
cipline
0.74
spans
0.73
flow
0.73
Activations Density 0.017%