• summary: extract keywords from doc --> extract keywords from file