INDEX
Explanations
words and phrases related to media production and evaluation processes
New Auto-Interp
Negative Logits
flashback
-0.13
Ways
-0.13
victim
-0.13
лÑıв
-0.13
overall
-0.13
ilet
-0.13
elites
-0.13
premises
-0.13
allon
-0.12
.Constants
-0.12
POSITIVE LOGITS
aspect
0.40
option
0.35
feature
0.34
section
0.34
portion
0.33
category
0.31
phenomenon
0.29
clause
0.28
aspect
0.28
mechanism
0.28
Activations Density 0.011%