INDEX
Explanations
phrases related to politics and constituents
New Auto-Interp
Negative Logits
SourceFile
-0.57
Released
-0.56
MpServer
-0.55
Orchestra
-0.53
confir
-0.53
acas
-0.51
yssey
-0.51
Lua
-0.50
las
-0.50
itled
-0.48
POSITIVE LOGITS
rather
0.89
anyway
0.88
wherever
0.86
everywhere
0.85
anyways
0.79
insofar
0.77
oneself
0.75
blindly
0.73
endlessly
0.72
inherently
0.72
Activations Density 1.031%