Azure-docs: Character limits for key phrase extraction

Created on 25 Jan 2019  Â·  3Comments  Â·  Source: MicrosoftDocs/azure-docs

I am using service level S1 indexing an Excel spreadsheet in blob storage and get the following error:
"message": "Skill #2: Document was truncated to 50000 characters before KeyPhrase extractionrnSkill #1: An error occurred when attempting to obtain entities from portions of input textrn"

Is the truncation to 50,000 characters a limit just for key phrase extraction? I'm assuming this is different that the 4 million character limit for documents in blob storage that is listed on this documentation page. Is the character limit for key phrase extraction different for different service levels? Is this documented?


Document Details

⚠ Do not edit this section. It is required for docs.microsoft.com ➟ GitHub issue linking.

cxp product-question searcsvc triaged

All 3 comments

@alathrop Thank you for this detailed feedback. We are investigating and will update you when we have accurate information to respond with.

Hi @alathrop, it's a limit on the key phrase extraction skill: https://docs.microsoft.com/en-us/azure/search/cognitive-search-skill-keyphrases#data-limits

Cognitive skills limits are not documented in service limits, but I'll take an action item to include a section that summarizes them.

@alathrop, We will close this out, but if you feel you need more information please just let us know.
@HeidiSteen, Thank you!

Was this page helpful?
0 / 5 - 0 ratings