Get content of HTML-tags

I need to search large html-files and find all texts between and </html tag> tags.

text test text1</html tag>
text test text2</html tag>
text test text3</html tag>

–>{“test text1”, “test text2”, “test text3”}

Fastest way would be using grep or something similar?