Snipping HTML using PHP

I

InstilledBee

Guest
So I'm making a website for a friend, and wants a custom blog to go along with it. I'm cool with writing the code for the blog and all, but I ran into a little issue with regarding to snipping the text. (to display it in a sort-of home portal)

Basically I need something that can snip the blog text into a certain length, like a summary of the blogpost of sorts, that would fit on the portal home page. But since the post would have HTML enabled and the HTML would be stored in the DB, snipping it may cut off part of the HTML and not display properly. So, it needs to be split into certain tokens and then output a certain amount of tokens.

I kind of get the logic and idea, but I don't know how to put it into code, basically. Er, programmer's block, perhaps? :p Any help would be appreciated. :)
 

UndeadDragon

Super Moderator
Reaction score
447
When displaying the text you could use PHP's substring:

PHP:
<body>
<?php 
$text = mysql_query("SELECT text FROM table");
$shortText = substr($text, 0, 30);
echo($shortText);
?>
</body>
 

Ghan

Administrator - Servers are fun
Staff member
Reaction score
888
I know you can use regular expressions to parse out the correct data while making sure that HTML tags are closed properly. Unfortunately I'm not versed enough in regexes in order to actually help with any code. :p
 
I

InstilledBee

Guest
When displaying the text you could use PHP's substring:

PHP:
<body>
<?php 
$text = mysql_query("SELECT text FROM table");
$shortText = substr($text, 0, 30);
echo($shortText);
?>
</body>

Hmm. Yeah I already got that part and am using substr(), but I still need code for making sure the HTML tags are properly parsed and do not get snipped halfway through the tag or something. But thanks! :D

I know you can use regular expressions to parse out the correct data while making sure that HTML tags are closed properly. Unfortunately I'm not versed enough in regexes in order to actually help with any code. :p

Well, I will try reading a thing or two on regexps. Thanks for the suggestion! :D
 

UndeadDragon

Super Moderator
Reaction score
447
Sorry, I didn't think of that part of it :p

Are there tags inside the actual passage of text, or do they just surround it?
 
I

InstilledBee

Guest
Hmm. What is originally planned is that the author can insert his own HTML tags as he writes the blogpost. For listing the posts, it is outputted on a <p> and trimmed if the length is greater than n, like so:

PHP:
	if($posts == 0) {echo '<h2>No posts yet.</h2>';}
	else {
		for($i = 0; $i < count($posts); $i++) {
			if(!$summary) {echo '<div class="pbody">';}
			echo '<h1><a href="index.php?page=blog&pid=', $i + 1, '">', $posts[$i]['title'], '</a></h1>';
			if(strlen($posts[$i]['message']) < $lim) {echo '<p>', $posts[$i]['message'], '</p>';}
			else {echo '<p>', substr($posts[$i]['message'], 0, $lim), '... <a href="index.php?page=blog&pid=', $i + 1, '">(Read more)</a></p>';}
			echo '<p><strong><em>Posted by ', $posts[$i]['author'], ' on ', $posts[$i]['time'], '</em></strong></p>';
			if(!$summary) {echo '</div>';}
		}
	}

(It's not the best code, but it gets the job done. :D Sorry if it looks, er, unorganized :eek:)
 

UndeadDragon

Super Moderator
Reaction score
447
With some playing around, I could work out if a set of tags was detected, and you can compare the before and after, however I can't work out out how to close an already opened tag... yet :p

PHP:
<?php
function numberOfTags( $html ) {
  preg_match_all("/(<([\w]+)[^>]*>)(.*?)(<\/\\2>)/", $html, $matches, PREG_SET_ORDER);
  
  return count($matches);
}

$string = "<b>Testing whether the tags are still here</b>";
$tagsBefore = numberOfTags($string);
$short = substr($string, 0, 10);
$tagsAfter = numberOfTags($short);

echo($string . ": " . $tagsBefore . " sets of tags<br />");
echo($short . ": " . $tagsAfter . " sets of tags<br />");
echo("<br />");

if($tagsBefore == $tagsAfter) echo("Success");
else echo("Tags do not match");
?>

http://labs.omega-designs.com/shortened.php
 

celerisk

When Zerg floweth, life is good
Reaction score
62
PHP:
# some random "html" content
$html = '<div>Testing... 1 2 3, <strong>BOLD HERE</strong>, <i>italic</i>. Done!<br />
<dl>
<dt>List test:</dt>
<dd>List item 1
<dd>List item 2</dd>
<dd><a href="http://www.thehelper.net/">Oh, a link. Click</a></dd>
<dd>Last item</dd>
</dl>';

# ask someone who knows what he is doing
$document = new DOMDocument();
$document->loadHTML($html);

# if you feel like seeing what it loaded as
# print_r($document->saveHTML());

# "extract" text version
$text = $document->getElementsByTagName('body')->item(0)->nodeValue;

# do whatever
echo "<pre>$text</pre>\n";
 
General chit-chat
Help Users
  • No one is chatting at the moment.
  • Varine Varine:
    How can you tell the difference between real traffic and indexing or AI generation bots?
  • The Helper The Helper:
    The bots will show up as users online in the forum software but they do not show up in my stats tracking. I am sure there are bots in the stats but the way alot of the bots treat the site do not show up on the stats
  • Varine Varine:
    I want to build a filtration system for my 3d printer, and that shit is so much more complicated than I thought it would be
  • Varine Varine:
    Apparently ABS emits styrene particulates which can be like .2 micrometers, which idk if the VOC detectors I have can even catch that
  • Varine Varine:
    Anyway I need to get some of those sensors and two air pressure sensors installed before an after the filters, which I need to figure out how to calculate the necessary pressure for and I have yet to find anything that tells me how to actually do that, just the cfm ratings
  • Varine Varine:
    And then I have to set up an arduino board to read those sensors, which I also don't know very much about but I have a whole bunch of crash course things for that
  • Varine Varine:
    These sensors are also a lot more than I thought they would be. Like 5 to 10 each, idk why but I assumed they would be like 2 dollars
  • Varine Varine:
    Another issue I'm learning is that a lot of the air quality sensors don't work at very high ambient temperatures. I'm planning on heating this enclosure to like 60C or so, and that's the upper limit of their functionality
  • Varine Varine:
    Although I don't know if I need to actually actively heat it or just let the plate and hotend bring the ambient temp to whatever it will, but even then I need to figure out an exfiltration for hot air. I think I kind of know what to do but it's still fucking confusing
  • The Helper The Helper:
    Maybe you could find some of that information from AC tech - like how they detect freon and such
  • Varine Varine:
    That's mostly what I've been looking at
  • Varine Varine:
    I don't think I'm dealing with quite the same pressures though, at the very least its a significantly smaller system. For the time being I'm just going to put together a quick scrubby box though and hope it works good enough to not make my house toxic
  • Varine Varine:
    I mean I don't use this enough to pose any significant danger I don't think, but I would still rather not be throwing styrene all over the air
  • The Helper The Helper:
    New dessert added to recipes Southern Pecan Praline Cake https://www.thehelper.net/threads/recipe-southern-pecan-praline-cake.193555/
  • The Helper The Helper:
    Another bot invasion 493 members online most of them bots that do not show up on stats
  • Varine Varine:
    I'm looking at a solid 378 guests, but 3 members. Of which two are me and VSNES. The third is unlisted, which makes me think its a ghost.
    +1
  • The Helper The Helper:
    Some members choose invisibility mode
    +1
  • The Helper The Helper:
    I bitch about Xenforo sometimes but it really is full featured you just have to really know what you are doing to get the most out of it.
  • The Helper The Helper:
    It is just not easy to fix styles and customize but it definitely can be done
  • The Helper The Helper:
    I do know this - xenforo dropped the ball by not keeping the vbulletin reputation comments as a feature. The loss of the Reputation comments data when we switched to Xenforo really was the death knell for the site when it came to all the users that left. I know I missed it so much and I got way less interested in the site when that feature was gone and I run the site.
  • Blackveiled Blackveiled:
    People love rep, lol
    +1
  • The Helper The Helper:
    The recipe today is Sloppy Joe Casserole - one of my faves LOL https://www.thehelper.net/threads/sloppy-joe-casserole-with-manwich.193585/
  • The Helper The Helper:
    Decided to put up a healthier type recipe to mix it up - Honey Garlic Shrimp Stir-Fry https://www.thehelper.net/threads/recipe-honey-garlic-shrimp-stir-fry.193595/

      The Helper Discord

      Members online

      No members online now.

      Affiliates

      Hive Workshop NUON Dome World Editor Tutorials

      Network Sponsors

      Apex Steel Pipe - Buys and sells Steel Pipe.
      Top