<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/css" href="/stylesheets/rss.css"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/">
  <channel>
    <title>Inspire Creativity: The Importance of Analyzing your Logs</title>
    <link>http://kevinwatt.org/articles/2005/06/06/the-importance-of-analyzing-your-logs</link>
    <language>en-us</language>
    <ttl>40</ttl>
    <description></description>
    <item>
      <title>The Importance of Analyzing your Logs</title>
      <description>&lt;p&gt;
You might have noticed, but I run a big poetry website called allpoetry.com.  For the first few years I ran nightly analysis on the logfiles, then hourly as the site got bigger, but it kept slowing things down.  
&lt;/p&gt;&lt;p&gt;
And I never looked at it.  So I disabled it, and I just clean up the log by hand every few months once it reaches a few gigabytes.
&lt;/p&gt;&lt;p&gt;
The important thing with statistics is figuring out what questions you want to be answering.  To me that&amp;#8217;s one of the big problems with research sometimes &amp;#8211; rummaging through answers to questions we don&amp;#8217;t have, and might not ever.
&lt;/p&gt;&lt;p&gt;
Anyways, I&amp;#8217;m playing around with me-driven logfile analysis of the last 100,000 rows (just using &amp;#8216;tail&amp;#8217; whenever I want to look at the stats.  Not quite as useful as historical stats, but faster.
&lt;/p&gt;&lt;p&gt;
One of my big problems also was that since it&amp;#8217;s a dynamic site, many pages are unique.  I really want a tool that will &amp;#8216;stem&amp;#8217; my pages, so /poem/14441?reply=yes will just be /poem.  It&amp;#8217;s more useful to see that there were 20,000 hits on poems, rather than 20 hits on poem number 14441.
&lt;/p&gt;&lt;p&gt;
I&amp;#8217;m sure others have confronted this problem before; if you have any ideas, let me know!
&lt;/p&gt;&lt;!&amp;#8212;technorati tags start&amp;#8212;&gt;&lt;p style="text-align:right;font-size:10px;"&gt;Technorati Tags: &lt;a href="http://technorati.com/tag/logs" rel="tag"&gt;logs&lt;/a&gt;&lt;/p&gt;&lt;!&amp;#8212;technorati tags end&amp;#8212;&gt;</description>
      <pubDate>Mon, 06 Jun 2005 23:19:17 -0500</pubDate>
      <guid isPermaLink="false">urn:uuid:035bfd5494d65a6866f7831fa47ecf17</guid>
      <author>kevin</author>
      <link>http://kevinwatt.org/articles/2005/06/06/the-importance-of-analyzing-your-logs</link>
      <category>Coding</category>
    </item>
  </channel>
</rss>
