| tm.plugin.webmining-package | Retrieve structured, textual data from various web sources |
| assignValues | Extract Main HTML Content from DOM |
| calcDensity | Extract Main HTML Content from DOM |
| corpus.update | Update/Extend 'WebCorpus' with new feed items. |
| corpus.update.WebCorpus | Update/Extend 'WebCorpus' with new feed items. |
| encloseHTML | Enclose Text Content in HTML tags |
| encloseHTML.character | Enclose Text Content in HTML tags |
| encloseHTML.PlainTextDocument | Enclose Text Content in HTML tags |
| extract | Extract main content from 'TextDocument's. |
| extract.PlainTextDocument | Extract main content from 'TextDocument's. |
| extractContentDOM | Extract Main HTML Content from DOM |
| extractHTMLStrip | Simply strip HTML Tags from Document |
| feedquery | Buildup string for feedquery. |
| getEmpty | Retrieve Empty Corpus Elements through '$postFUN'. |
| getEmpty.WebCorpus | Retrieve Empty Corpus Elements through '$postFUN'. |
| getLinkContent | Get main content for corpus items, specified by links. |
| getMainText | Extract Main HTML Content from DOM |
| GoogleFinanceSource | Get feed Meta Data from Google Finance. |
| GoogleNewsSource | Get feed data from Google News Search <URL: http://news.google.com/> |
| json_content | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
| NYTimesSource | Get feed data from NYTimes Article Search (<URL: http://developer.nytimes.com/docs/read/article_search_api_v2>). |
| nytimes_appid | AppID for the NYtimes-API. |
| parse | Wrapper/Convenience function to ensure right encoding for different Platforms |
| readGoogle | Get feed Meta Data from Google Finance. |
| readNYTimes | Get feed data from NYTimes Article Search (<URL: http://developer.nytimes.com/docs/read/article_search_api_v2>). |
| readReutersNews | Get feed data from Reuters News RSS feed channels. Reuters provides numerous feed |
| readWeb | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
| readWebHTML | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
| readWebJSON | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
| readWebXML | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
| readYahoo | Get feed data from Yahoo! Finance. |
| readYahooHTML | Get news data from Yahoo! News (<URL: https://news.search.yahoo.com/search/>). |
| readYahooInplay | Get News from Yahoo Inplay. |
| removeNonASCII | Remove non-ASCII characters from Text. |
| removeNonASCII.PlainTextDocument | Remove non-ASCII characters from Text. |
| removeTags | Extract Main HTML Content from DOM |
| ReutersNewsSource | Get feed data from Reuters News RSS feed channels. Reuters provides numerous feed |
| source.update | Update WebXMLSource/WebHTMLSource/WebJSONSource |
| source.update.WebHTMLSource | Update WebXMLSource/WebHTMLSource/WebJSONSource |
| source.update.WebJSONSource | Update WebXMLSource/WebHTMLSource/WebJSONSource |
| source.update.WebXMLSource | Update WebXMLSource/WebHTMLSource/WebJSONSource |
| tm.plugin.webmining | Retrieve structured, textual data from various web sources |
| trimWhiteSpaces | Trim White Spaces from Text Document. |
| WebCorpus | WebCorpus constructor function. |
| webmining | Retrieve structured, textual data from various web sources |
| WebSource | Read Web Content and respective Link Content from feedurls. |
| YahooFinanceSource | Get feed data from Yahoo! Finance. |
| YahooInplaySource | Get News from Yahoo Inplay. |
| yahoonews | WebCorpus retrieved from Yahoo! News for the search term "Microsoft" through the YahooNewsSource. Length of retrieved corpus is 20. |
| YahooNewsSource | Get news data from Yahoo! News (<URL: https://news.search.yahoo.com/search/>). |