Module Assessment | Yext Hitchhikers Platform

loading
You must include a first name and last name in your profile to create challenge environments.

Background

The Turtlehead Tacos team maintains a separate Help Site for people asking questions around online ordering or using the mobile app. They have a completely separate content management process for that so you don’t want to disrupt that, but you know that eventually you’ll want to make sure you can surface answers to these support questions in your Search experience. To prepare for that, you’ll set up a crawler for those help article pages so you can start to ingest the content. Later on, you can worry about adding this to your Search experience.

The help articles you’ll be working with are on help.turtleheadtacos.com.

Your Challenge

  1. Navigate to Content > Configuration > Crawlers.

  2. Click + New Crawler.

  3. Fill out the crawler settings:

    • Name: “Help Articles”
    • Schedule: “Once” – for now you’re just going to do a one-time import, but later you may update this to do it daily or weekly.
    • Crawl Strategy: Sub-Pages – you only want to crawl this part of the site.
    • Domain to Crawl: https://help.turtleheadtacos.commake sure not to add a trailing “/” on the URL
  4. Click “Save Crawler”. Congrats, you’ve created a Crawler!

  5. It may take a few moments but you should see a Crawl in Progress. It should be complete once you see that 6 pages have been crawled. Wait until it completes (you might need to refresh the page). If you run into any errors, please reach out in the Community.

  6. Now that you’ve set up the Crawler and extracted the HTML off the page, you can set up a Data Connector using the Crawler as a Source. Before you do that, we need to enable the Help Article entity type. Navigate to Content > Configuration > Entity Types and enable Help Article.

  7. Navigate to Content > Connectors.

  8. Click + Add a Connector.

  9. Select Site Crawler.

  10. Set your Crawler Extraction Settings and then click “Continue”.

    • Crawler: Help Articles (the Crawler you set up in Steps 2-4)
    • URLs: Select Specific URLs or URL Patterns and enter https://help.turtleheadtacos.com/* – this will pull all of the help articles but not the homepage
  11. Select the Page Type as Detail page - each of the help articles is on its own page.

  12. Click “Add Default Selectors”. You’ll see the Page ID, Page URL and Page Title pulled. Click Add Selector at the top to add a couple more selectors and then click “Continue”.

    • Add a selector with Header of “Body” and use the “Cleaned Body Content” as your specified Path. This will pull in the help body itself.

    • Add a selector using CSS Selector with Header of “Tags”. You want to pull in the tags at the bottom of the help articles like this one . Inspect the page and/or use https://try.jsoup.org/ to try to find the right CSS Selector to pull the list in. To modify your selector, remember that you can hover over the Tags column and click on the pencil icon to edit the selector and try again. If you can’t figure it out, you can find the CSS Selector in this gist .

  13. Select Help Article as the Entity Type.

  14. Now it’s time to map the selectors to fields on your Help Article Entity Type. You’ll see the Selectors from the last step in the first Column with a preview of the data in the second column. Update the “Map to Field” column with the following and then click “Save”.

    • Page ID -> Entity ID
    • Page URL -> Landing Page URL
    • Page Title -> Name
    • Body -> Body with subfield Markdown
    • Tags -> Keywords mapped to an entire list with “,” as the Delimiter
  15. Click Save at the bottom of the page. You will be prompted to enter a Name and ID. Enter the following:

    • Name: Help Articles
    • ID: helpArticles
  16. Click the “Save & Run Now” to pull the entities into your account. Run in Default Mode.

  17. Monitor your run to make sure it is successful. You should see 5 successful adds! Click Content > Entities to see your new “Help Article” entities.

Module Assessment
+<% util.points %> points
loading
Weekly Challenge Streak Weekly Challenge Streak: <% util.streakWeekly %>
You must include a first name and last name in your profile to create challenge environments.
Challenge account expires in <% util.expirationHours > 24 ? Math.round((util.expirationHours * 1.0) / 24) : util.expirationHours %> <% util.expirationHours > 24 ? (Math.round((util.expirationHours * 1.0) / 24) == 1 ? 'day' : 'days') : (util.expirationHours == 1 ? 'hour' : 'hours') %>.
Challenge account has expired. Please create a new account.
Report Card
Add a crawler for help.turtleheadtacos.com
Add a data connector for help articles built on top of your web crawler
Run your connector to pull in the help articles
Enable the Help Article entity type
Previous Submissions
Attempt #<% submission.attemptNumber %>
<% submission.date %>
Score: <% submission.numericScore %>
Pending