INDEX
Explanations
verbal phrases related to actions and events
New Auto-Interp
Negative Logits
ãĥĥãĥī
-0.90
encl
-0.90
BIT
-0.89
corrid
-0.83
cath
-0.82
²¾
-0.82
iHUD
-0.80
âĢ¢âĢ¢âĢ¢âĢ¢
-0.80
ushima
-0.80
gemony
-0.78
POSITIVE LOGITS
dates
1.95
stairs
1.54
dating
1.44
olean
1.26
rison
1.21
grade
1.14
rons
1.13
olicy
1.13
oons
1.11
graded
1.08
Activations Density 0.774%