Manage web files with FileWizard

  Manage Web files with FileWizard

Whether your website is hosted by a web hosting company or on your own server, you can use FileWizard to manage it

Questions that FileWizard can Answer

 The Knozall website is hosted by a web hosting company that provides email, storage, and a shopping cart.

 The examples shown here illustrate some of the things that are considered in managing the website.

 If you host your own website the task will be very similar.

Three Easy Steps

1.      Define the website as a Volume

2.      Scan the Volume

3.      Report Results

Setup FileWizard to see the Website as a Volume

Defining the website as a volume is easy using FileWizard’s UserVolume capability.  You can define almost anything as a user volume.

 The next few figures will show you how to define your website as a user volume.

 

 

Click image to expand

Figure 1 FileWizard Query Screen

 In the Standard query screen, click on the Select Volume icon . This brings up the FileWizard Query Screen that shows the volumes associated with mapped drives on your workstation. The two check boxes at the bottom of the screen let you choose to display "Logical Volumes" and "User Volumes".  Logical volumes are drives you have mapped and User Volumes are volumes you have defined to FileWizard to analyze. If you have not defined any user volumes, when  you uncheck the Logical volumes button, there will be no User volumes to display.

 Click image to expand

Figure 2 Select Volume Screen

 

To add your website as a UserVolume, click on “User Volumes” in the upper left portion of the window and then select “Add Volume”.

 

 

Click image to expand

Figure 3 Add Volume

 

The Add Volume screen lets you define the parameters of the volume you wish to add.

 

 

Click image to expand

Figure 4 Add Volume Screen

 

In this window do the following:   Click on FTP site

                     Deselect Use default, and click on Save Password

                 Type in the Username for the site to be scanned.

                     Type in the Password for the site to be scanned.

                     Check Passive Transfers box

                     Click “OK” to save this user defined volume.

Click image to expand

Figure 5 Add Volume Specifications

 

The site www.knozall.com has been defined as an ftp site, and can now be selected and scanned.  Any of the specifications can be changed by highlighting the volume and choosing "Edit Volume" in the Select Volume screen.

 

 

Click image to expand

Figure 6 Select Volume - displaying  User volumes

Select and Scan the Volume

Highlight the volume and click “Scan Now” to scan the volume.  Scanning the volume creates a FileWizard database that is used to provide results as various queries are run.

 

 

Click image to expand

Figure 7 Select ftp.www.knozall.com as a volume to scan

Create a Webfiles Query

Create a query by selecting File Name, File Size, File Type and File Modify Date.  Drag and drop them from the left pane to the right pane of the query window.  Note that the “traffic light” has turned green, indicating that enough items have been selected to make a valid query. Make sure that the subfolders icon is selected so that all subfolder     contents will be reported.

Click image to expand

Figure 8 Create a query

Report Results

What files are on the website?

FileWizard Results screens and reports are produced by creating queries that will gather information from the database created by the scan.

 

 

Click image to expand

Figure 9 Webfiles Results Window  (sorted by Size)

 

The results are shown in tabular form.  From the bottom of this window we see that there are 11,020 objects found which total 351MB in size, and there is 706 days between the earliest and latest create dates.

From this report, the data may be sorted by any of the columns in either ascending or decending order by clicking on the column title. 

The left pane of the window shows the directories in the web site, so by highlighting a directory, just the files in that directory and its sub directories would be shown.

Find which are the 10 largest files on the web

Now let’s modify the query to see which 10 files are the top 10 space users.

This is done by clicking on the Result Threshold icon.

The Result Threshold window allows you to set the number of items you wish to have displayed (either files, folders or both).  You can set this number to any value you wish.  We chose 10 because it produces graphical results that are easy to use.

 

Click image to expand

Figure 10 Result Threshold Screen

 

Note that 11 files are shown, rather than 10.  This is due to the fact that the file name of file 10 and file 11 are identical.

 

Click image to expand

Figure 11 Largest files on web query

The tabular report can be changed to graphical presentation by selecting either the pie chart  or bar graph  icons.

 

 

 

Click image to expand

Figure 12 The 10 largest files on the website (Bar Chart)

 

Or the data can be displayed as a pie chart.

 

Click image to expand

Figure 12 Chart of Largest files on web

Find the space useage by file type

Create a listing by type.  All that is needed for query items is the File Size and File Type.   The Result Threshold window is set to select the File Types that are the top ten space users. (This could be changed to any number you prefer).   As we might expect there is 124.5 MB of zip files, and 44.5 MB of pdf files.

 

 

 

Click image to expand

Figure 13 Top 10 web file types

 

As we may have anticipated, the large values for pdf and exe files is a result of having these files available for download on the ftp site. 

 

Click image to expand

Figure 14 Graph of Top Ten file types

What is in the download directory?

Download directory results are displayed. There are 45 files, with a total size of 33.4MB.

 

 

 

Click image to expand

Figure 15 Files in the download directory

 

Choose graph to show the top file types in the download directory.

 

 

Click image to expand

Figure 16 Top Types in the download Site

What files are in the browsable web site itself?

Checking out the files in the portion of the web site that is browsable is done by highlighting the directory www-knozall.

There are 9986 files, using 125MB of space in the web site.  The 0 Days indicates that all files were modified on the same date.

Click image to expand

Figure 17 Files in Web

 

The age of web files can be determined by looking at modify dates versus time. This data shows that about 133Mb were last modified in the quarter ending May 6, 2007, and 152 MB were modified in the quarter ending  April 30,2008. 

 

Click image to expand

Figure 18 Modify Dates over time

Who is using space in mailboxes?

By highlighting the knozall.com directory, the space used by mailboxes is shown.  The results are shown sorted by mailbox size.

 

 

 

Click image to expand

Figure 19 Tabular Listing of  Mailboxes

Produce Reports Automatically to track your website

Using FileWizard’s scheduling capability, you can create a script to scan, query, and even copy the contents of your web site to a local drive for backup purposes.

The starting point is in the FileWizard Schedule Window.

Click image to expand

Figure20 FileWizard Schedule Window

Building a schedule

Drag and drop the scan icon into the right half pane.  Double clicking will present the Select Volume window.  Highlight the user volume that is the website you wish to manage.

Click image to expand

Figure 21 Select volume to scan

Next drag and drop the query icon into the right half pane. Double clicking on the query icon presents a list of the saved queries.  For this example, we will choose “top web types” as our query. 

 

Click image to expand

Figure 22 Selecting the Query

 

Then click on “Set Initial Folder”, and in the Initial Folder window, select “From Query File” and press OK in both windows.  You now have a schedule window with  Scan and Query actions defined.

 

Click image to expand

Figure 23 Setting the Initial Folder (where the query is to start)

 Next drag and drop the Print icon to the left half pane.

 

Click image to expand

Figure 24 Setting the source for the Print command

 

Double clicking on the Print icon brings up the window asking for the source for the information to be printed.  Click on the “From Query” button to see a list of queries that have been run in this schedule and select the one of your choice.

 

 

Click image to expand

Figure 25 Print command source specified

After selecting the source, a standard print definition window allows selection of the printer.

Add the “Largest files on web” query and print command. This is done by dragging the query icon into the right hand pane, double clicking on the query icon and choosing the query “Largest files on web” from the list.  The print command is added as shown earlier.

You can add as many query and print commands as you wish in the schedule.

Scheduling the Script

Click on the schedule icon  to set the time of operation of the script.

 

 

Click image to expand

Figure 26 Schedule window

Use the SetActive icon  to set the script to active status.

 

 

Click image to expand

Figure 27  Schedule Run Status

The schedule run window indicates the start date and time, the interval the script is to be run, and whether it is set active or inactive.

 Add to the schedule all the scans, queries and reports you wish to run on a periodic basis.  Use the regularly produced output to manage your website.

©2008 Knozall Software, Inc.,  (800) 333 8698   (520) 426-3859
9386 N. Linnet Road, Casa Grande, Arizona 85294