Add a web data sourceManage Springboard data sources
This topic details how to add a web data source associated with a Springboard application.
-
In the Applications Manager screen, select the application to add the new data source, and then select the new browser tab the system opens.
-
If you are adding:
-
The initial data source for the application:
-
In the application’s Hub screen, click Configure.
-
In the Welcome screen, click Get Started.
-
In the Add external content source screen, click the Web data source type.
-
-
An additional web data source:
-
In the application’s Hub screen, click the Data Sources icon in the left panel, or scroll to the Data Sources section of the screen and click Manage in the top right corner of the section.
-
In the Data Sources screen, click +New in the top right corner.
-
In the Add data source screen, click Web.
-
-
-
In the Data source name field, enter a short name. For example, Company_FAQ.
-
In the Start URL field, enter the initial location to begin crawling for the application. The URL can be a site, a sitemap, or a sitemap index.
-
In the Labels field, enter optional values to identify the data source. For example, FAQ.
-
In the Region field, select the region to ingest your data source’s data in.
Region selection is permanent. -
Click Continue.
-
In the Include pages field, select one of the following options to specify which site pages to crawl:
-
Pages under the start URL
-
Pages on this site and its subdomains
-
-
In the Include file types field, select all applicable file types:
-
Slide
-
PDF
-
Spreadsheet
-
Word
For more information, see Web data source file types and File extension processing.
-
-
In the Include external domains for selected file types field, enter external domains that contain the selected file types you want to include in the crawl and press Enter. The format for the entry is
example.com
, nothttps://example.com
. Do not usehttps:
in the format for the external domain. Subdomains are automatically included, unless added to the list of exclude links.The URL entered displays under the field and the field clears to add another URL. To remove a URL, click the X to the right of the entry.
Other fields affect whether files from URLs entered are included in the crawl. For more information, see Requirements for files to be included in the crawl. -
In the Include meta tags field, enter a metadata tag to include during ingestion and press Enter. The metadata tag entered displays under the field and the field clears to add another metadata tag. To remove a metadata tag, click the X to the right of the entry. If the metadata tags you enter exist and contain values, they are ingested in the crawl.
-
In the Include query parameters field, enter a query parameter and then click the + sign or press Enter. The parameter entered displays under the field and the field clears to add another parameter. All of the parameters entered combine to identify a unique web page. For more information, see Include query parameters functionality details.
-
In the Include links field, add full or partial URLs that you wish to include in the crawl, and then click the + sign.
-
In the Exclude links field, add full or partial URLs to exclude from the web crawl, and then click the + sign.
-
In the Data ingest run scheduling field, create a schedule to specify when the data ingestion automatically runs. By default, the data source schedule is set to Monthly with the date and time based on your browser time when adding the data source.
To initiate an on-demand data ingestion outside the regular schedule, follow the process to run on-demand data ingestion. -
In the Limit crawl levels field, click and drag the scale to the number of levels to crawl.
-
To save the data source information, click Save & Run.
-
If you are adding:
-
The initial data source for the application, the Summary screen displays a message that the data source has been created. Click Finish to close the wizard.
-
An additional web data source, the new data source displays in the Data Sources page.
-
-
In the Data Sources page, verify the new data source is listed with its current status.
The data source may take several minutes to be created and begin crawling the specified URL.
Additional information
-
For conceptual information, see Data sources.
-
For information about the user interface screen, see Data sources screen.