Skip to main content

Manipulating the DOM

Now that you have learned the basics of using Cheerio and have gained some experience with loading and traversing documents, it's time to dive deeper into manipulating elements. Whether you want to modify element attributes and properties, add and remove classes, modify text and HTML content, or insert and remove elements, Cheerio provides a range of methods to help you do so.

In this guide, we will focus specifically on manipulating elements within a document using Cheerio. We will cover methods for modifying element attributes and properties, adding and removing classes, modifying text and HTML content, inserting and removing elements, and handling errors and debugging. By the end of this guide, you will have a good understanding of how to use these methods to manipulate elements within a document using Cheerio.

Modifying Element Attributes and Properties

To modify the attributes and properties of a single element, you can use the attr() and prop() methods, respectively. Both methods take a key and a value as arguments, and allow you to get and set the attribute or property. When setting, they apply to all elements in the selection; when getting, they return a single value corresponding to the first element in the selection.

// Set the 'src' attribute of an image element
$('img').attr('src', 'https://example.com/image.jpg');

// Set the 'checked' property of a checkbox element
$('input[type="checkbox"]').prop('checked', true);

// Get the 'href' attribute of a link element
const href = $('a').attr('href');

// Get the 'disabled' property of a button element
const isDisabled = $('button').prop('disabled');

prop() is not limited to simple values like strings and booleans. You can also use it to get complex properties like the style object, or to resolve href or src URLs of supported elements. You can also use it to get the tagName, innerHTML, outerHTML, textContent, and innerText properties of a single element.

// Get the `style` object of an element
const style = $('div').prop('style');

// Get the resolved `src` URL of an image element
$('img').prop('src');

// Get the outerHTML of an element
const outerHTML = $('div').prop('outerHTML');

// Get the innerText of an element
const innerText = $('div').prop('innerText');

Adding and Removing Classes

To add or remove classes from an element, you can use the addClass(), removeClass(), and toggleClass() methods. All three methods take a class name or a space-separated list of class names as an argument. They modify all elements in the selection.

// Add a class to an element
$('div').addClass('new-class');

// Add multiple classes to an element
$('div').addClass('new-class another-class');

// Remove a class from an element
$('div').removeClass('old-class');

// Remove multiple classes from an element
$('div').removeClass('old-class another-class');

// Toggle a class on an element (add if it doesn't exist, remove if it does)
$('div').toggleClass('active');

Modifying the Text Content of an Element

To query or modify the text content of an element, you can use the text() method. Given a string as an argument, it sets the text content of every element in the selection to the given string. Without arguments, it returns the text content of every element (including its descendants) in the selection, concatenated together.

// Set the text content of an element
$('h1').text('Hello, World!');

// Get the text content of an element
const text = $('p').text();
Note

text() returns the textContent of all passed elements. The result will include the contents of <script> and <style> elements. To avoid this, use .prop('innerText') instead.

Modifying the HTML Content of an Element

To query or modify the HTML content of an element, you can use the html() method. Given an HTML string as an argument, it sets the inner HTML of every element in the selection to the given string. Without arguments, it returns the inner HTML of the first element in the selection.

// Set the inner HTML of an element
$('div').html('<p>Hello, World!</p>');

// Get the inner HTML of an element
const html = $('div').html();

Inserting New Elements

To insert new elements into a document, you can use the append(), prepend(), before(), and after() methods. These modify every element in the selection.

// Append an element to the end of a parent element
$('ul').append('<li>Item</li>');

// Prepend an element to the beginning of a parent element
$('ul').prepend('<li>Item</li>');

// Insert an element before a target element
$('li').before('<li>Item</li>');

// Insert an element after a target element
$('li').after('<li>Item</li>');

The insertAfter() and insertBefore() methods allow you to insert an element before or after a target element, respectively. Both methods take a string or a Cheerio object as an argument and insert the given element before or after the target element.

const $ = require('cheerio');

// Insert an element after a target element
$('<p>Inserted element</p>').insertAfter('h1');

// Insert an element before a target element
$('<p>Inserted element</p>').insertBefore('h1');

The prependTo() and appendTo() methods allow you to prepend or append an element to a parent element, respectively. Both methods take a string or a Cheerio object as an argument and insert the element at the beginning or end of the given parent element.

const $ = require('cheerio');

// Prepend an element to a parent element
$('<p>Inserted element</p>').prependTo('div');

// Append an element to a parent element
$('<p>Inserted element</p>').appendTo('div');

Wrapping and Unwrapping Elements

Sometimes you may want to wrap an element in another element, or remove the element's parent element while keeping its children. To do this, you can use the wrap(), wrapInner(), and unwrap() methods.

The wrap() method takes a string or a Cheerio object as an argument and wraps the element in the given element.

// Wrap an element in a div
$('p').wrap('<div></div>');

The wrapInner() method works similar to wrap(), but instead of wrapping the element itself, it wraps the element's inner HTML in the given element.

// Wrap the inner HTML of an element in a div
$('div').wrapInner('<div></div>');

The unwrap() method removes the element's parent element, while keeping the element and its children.

// Unwrap an element
$('p').unwrap();

Replacing Elements

To replace an element with another element, you can use the replaceWith() method. It takes a string or a Cheerio object as an argument and replaces each element in the selection with the given element.

// Replace an element with another element
$('li').replaceWith('<li>Item</li>');

Note that the replaceWith() method removes the element from the document and replaces it with the given element or HTML string. If you want to keep the element and modify its contents, you can use the html() or text() methods instead.

Removing Elements

To remove an element from a document, you can use the remove() method. It removes each element in the selection, and all of their children, from the document.

// Remove an element from the document
$('li').remove();

Alternatively, you can remove the children of an element from the document, without removing the element itself, using the empty() method. It removes the children (but not text nodes or comments) of each element in the selection from the document.

// Remove an element's children from the document
$('li').empty();

Conclusion

We learned how to manipulate elements within a document using Cheerio. We can modify element attributes and properties, add and remove classes, modify text and HTML content, insert and remove elements, and handle errors and debug our code. With these tools, you can easily manipulate a document to suit your needs.