INDEX
Explanations
terms related to ideology and conceptual frameworks
New Auto-Interp
Negative Logits
ston
-0.17
abouts
-0.16
lord
-0.16
lied
-0.15
sta
-0.15
akte
-0.15
neh
-0.15
_Release
-0.15
nie
-0.14
imo
-0.14
POSITIVE LOGITS
pend
0.18
ologies
0.15
supra
0.15
ologically
0.15
entities
0.15
yll
0.15
ideal
0.15
eus
0.14
opathic
0.14
ias
0.14
Activations Density 0.010%