INDEX
Explanations
information related to current political events and government policies
New Auto-Interp
Negative Logits
heit
-0.67
CONCLUS
-0.64
Cyp
-0.64
quant
-0.60
Tuc
-0.58
assum
-0.56
advertisement
-0.56
peak
-0.56
heimer
-0.55
ãĥ¼ãĥĨãĤ£
-0.53
POSITIVE LOGITS
everal
0.82
Sorry
0.80
Latest
0.73
ccording
0.72
Actor
0.70
outher
0.70
Former
0.68
Thousands
0.66
Scientists
0.65
Amid
0.63
Activations Density 0.043%