Action that extracts comprehensive metadata from web pages including OpenGraph, Twitter Cards, Schema.org structured data, and standard HTML meta tags

Example

// Extract all metadata from a webpage
await runAction({
ActionName: 'URL Metadata Extractor',
Params: [{
Name: 'URL',
Value: 'https://example.com/article'
}]
});

// Extract specific metadata types
await runAction({
ActionName: 'URL Metadata Extractor',
Params: [{
Name: 'URL',
Value: 'https://news.site.com/story'
}, {
Name: 'IncludeOpenGraph',
Value: true
}, {
Name: 'IncludeSchemaOrg',
Value: false
}]
});

Hierarchy (view full)

Constructors

Properties

MAX_CONTENT_SIZE: number = ...
TIMEOUT: 10000 = 10000

Methods

  • Executes metadata extraction from the specified URL

    Parameters

    • params: RunActionParams<any>

      The action parameters containing:

      • URL: Web page URL to extract metadata from (required)
      • IncludeOpenGraph: Extract OpenGraph metadata (default: true)
      • IncludeTwitterCards: Extract Twitter Card metadata (default: true)
      • IncludeSchemaOrg: Extract Schema.org JSON-LD data (default: true)
      • IncludeBasicMeta: Extract standard HTML meta tags (default: true)
      • IncludeFavicon: Extract favicon information (default: true)

    Returns Promise<ActionResultSimple>

    Comprehensive metadata extracted from the webpage

  • Executes the action with the provided parameters.

    Parameters

    • params: RunActionParams<any>

      The action execution parameters including context

    Returns Promise<ActionResultSimple>

    Promise resolving to the action result

  • Extracts all types of metadata from HTML content

    Parameters

    • html: string
    • baseUrl: URL
    • includeOpenGraph: boolean
    • includeTwitterCards: boolean
    • includeSchemaOrg: boolean
    • includeBasicMeta: boolean
    • includeFavicon: boolean

    Returns Promise<any>