INDEX
Explanations
mention of specific locations or facilities related to daily activities
terms related to specific objects, actions, and concepts associated with daily life and societal responsibilities
New Auto-Interp
Negative Logits
hap
-0.48
actionDate
-0.46
SPI
-0.45
Amb
-0.45
Tony
-0.44
oba
-0.44
ived
-0.43
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.43
NBC
-0.42
Adren
-0.42
POSITIVE LOGITS
ahime
0.63
destro
0.60
rul
0.59
utenberg
0.54
chuk
0.53
nodd
0.53
shenan
0.52
queue
0.51
rique
0.50
ij士
0.50
Activations Density 1.390%