You have migrated posts from a WordPress site to SharePoint. However, you discover that the formatting of the page content is incorrect after migration. Typically, you see that the paragraphs are not properly separated so the SharePoint blog post looks like one large paragraph.
When Content Matrix fetches the posts from the WordPress API, they are encoded into XML formatting. This includes the page content as well. For example, below are two paragraphs from a source WordPress site.
Below is a screenshot of the post extracted from WordPress. The carriage return in XML is encoded with the highlighted text. However, this is not recognized properly as such in SharePoint.
After migration, the blog post in SharePoint will appear as:
Hi this post is published. Update to post
To resolve the issue, we need to add a simple transformer script within the Content Matrix Console. To do this, right-click on the WordPress site node and select "Edit Transformations".
We use a Regular Expression (Regex) to apply the transformation. Because the XML encoding is a simple carriage return, we can use the Regex single character escape sequence \r as the Match expression. The Value we set to replace it will be the standard HTML tag
. See the screenshot below.
© 2025 Quest Software Inc. ALL RIGHTS RESERVED. Terms of Use Privacy Cookie Preference Center