INDEX
Explanations
phrases related to institutions or structured systems
terms and phrases related to employment and work conditions
New Auto-Interp
Negative Logits
elcome
-0.78
Nusra
-0.74
ÄŁ
-0.73
conn
-0.72
worthiness
-0.72
veyard
-0.69
agen
-0.68
dam
-0.67
ippery
-0.67
hap
-0.66
POSITIVE LOGITS
referen
0.77
entertainment
0.76
olds
0.73
IDE
0.69
guest
0.67
event
0.66
gorilla
0.65
approach
0.65
educational
0.64
enterprise
0.64
Activations Density 0.372%