INDEX
Explanations
references to programs, initiatives, or systems related to education, scholarships, or health
New Auto-Interp
Negative Logits
ajs
-0.15
ritel
-0.15
aeda
-0.14
iren
-0.14
gros
-0.14
bbe
-0.14
endor
-0.13
raki
-0.13
onaut
-0.13
ores
-0.13
POSITIVE LOGITS
åıĬåħ¶
0.21
INCLUDING
0.19
including
0.17
including
0.17
quir
0.16
ãģ«ãģ¤ãģĦãģ¦
0.16
briefly
0.15
elay
0.15
leyin
0.15
¥
0.15
Activations Density 0.073%