[MKSearch-dev] http://test.mksearch.mkdoc.org/ down?

Jeff Albro jalbro at bu.edu
Mon Feb 25 21:50:41 GMT 2008


I can confirm that the site is back up... but it is still not working
for me... How sensitive it is to java version?
I'm using:

export mk_build=/home/jalbro/mksearch/build
export mk_home=/home/jalbro/mksearch
#export CLASSPATH=/usr/share/java/libgcj-3.4.1.jar
export CLASSPATH=/usr/share/java/libgcj-3.4.3.jar

And I get the error below.  I also got it with gij-jspider.

But I can wget http://test.mksearch.mkdoc.org/ just fine.

Ideas?

Thanks!

-Jeff

sed-linux:~/mksearch/bin$ $mk_home/bin/java-jspider.sh
http://test.mksearch.mkdoc.org/ triple
@jspider.version.string@
Build: @build.DSTAMP@
Started from .
[Engine] jspider.home=/home/jalbro/mksearch
[Engine] default output folder=/home/jalbro/mksearch/output
[Engine] starting with configuration 'triple'
Loading 2 plugins.
Loading plugin configuration 'console'...
first trying to instantiate via ctr with (name, config) params
plugin 'console' prefix is '[Plugin]'
adding space after prefix
Prefix set to '[Plugin] '
plugin instantiated.
Plugin not configured for local event filtering
Plugin Name    : Console writer JSpider module
Plugin Version : v1.0
Plugin Vendor  : http://www.javacoding.net
Loading plugin configuration 'xhtmltriple'...
first trying to instantiate via ctr with (name, config) params
cannot instantiate module - constructor with name and PropertySet params
not found
java.lang.NoSuchMethodException: <init>
plugin not yet instantiated, trying via ctr with (config) param
Custom application profile com.mkdoc.schema.DublinCoreProfile loaded.
plugin instantiated.
Plugin uses local event filtering
EventDispatcher for Plugin 'XHTML metadata triple writer plugin for
JSpider' configuring...
EventFilter for engine events =
net.javacoding.jspider.mod.eventfilter.AllowNoneEventFilter
EventFilter for monitor events =
net.javacoding.jspider.mod.eventfilter.AllowNoneEventFilter
EventFilter for spider events =
net.javacoding.jspider.mod.eventfilter.AllowAllEventFilter
EventDispatcher EventDispatcher for Plugin 'XHTML metadata triple writer
plugin for JSpider' configured.
Plugin Name    : XHTML metadata triple writer plugin for JSpider
Plugin Version : v0.7
Plugin Vendor  : http://www.mkdoc.com
Loaded 2 plugins.
Global Event Dispatcher configuring...
EventFilter for engine events =
net.javacoding.jspider.mod.eventfilter.AllowAllEventFilter
EventFilter for monitor events =
net.javacoding.jspider.mod.eventfilter.AllowAllEventFilter
EventFilter for spider events =
net.javacoding.jspider.mod.eventfilter.AllowAllEventFilter
EventDispatcher Global Event Dispatcher configured.
Global Event Dispatcher intializing...
EventDispatcher for Plugin 'XHTML metadata triple writer plugin for
JSpider' intializing...
EventDispatcher for Plugin 'XHTML metadata triple writer plugin for
JSpider' intialized.
Global Event Dispatcher intialized.
Storage provider class is 'class
net.javacoding.jspider.core.storage.memory.InMemoryStorageProvider'
rule net.javacoding.jspider.mod.rule.OnlyHttpProtocolRule hasn't got a
config-param constructor
added rule net.javacoding.jspider.mod.rule.OnlyHttpProtocolRule to
spider ruleset
rule net.javacoding.jspider.mod.rule.TextHtmlMimeTypeOnlyRule hasn't got
a config-param constructor
added rule net.javacoding.jspider.mod.rule.TextHtmlMimeTypeOnlyRule to
parser ruleset
default user Agent is 'MKSearch 0.1 (http://www.mksearch.mkdoc.org)'
TaskScheduler provider class is 'class
net.javacoding.jspider.core.task.impl.DefaultSchedulerProvider'
Spider born - threads: spiders: 1, thinkers: 1
Worker thread (Spider 0) born
Worker thread (Thinker 0) born
[Plugin] Module : Console writer JSpider module
[Plugin] Version: v1.0
[Plugin] Vendor : http://www.javacoding.net
[Plugin] Spidering Started, baseURL = http://test.mksearch.mkdoc.org/
using userAgent 'MKSearch 0.1 (http://www.mksearch.mkdoc.org)' for site
'http://test.mksearch.mkdoc.org'
rule net.javacoding.jspider.mod.rule.InternallyReferencedOnlyRule hasn't
got a config-param constructor
added rule net.javacoding.jspider.mod.rule.InternallyReferencedOnlyRule
to spider ruleset
rule net.javacoding.jspider.mod.rule.BaseSiteOnlyRule hasn't got a
config-param constructor
added rule net.javacoding.jspider.mod.rule.BaseSiteOnlyRule to parser
ruleset
[Plugin] site discovered : http://test.mksearch.mkdoc.org
[Plugin] net.javacoding.jspider.api.event.site.RobotsTXTSkippedEvent
RobotsTXTSkippedEvent for site [Site: http://test.mksearch.mkdoc.org -
ROBOTSTXT_SKIPPED *]
[Plugin] resource discovered: http://test.mksearch.mkdoc.org/
Thinker task dispatcher running ...
Spider task dispatcher running ...
Throttle provider class is 'class
net.javacoding.jspider.core.throttle.impl.DistributedLoadThrottleProvider'
throttle interval set to 1000 ms.
exception during spidering
java.lang.ClassCastException: java.util.List
java.util.List
[Plugin] Error event comment: resource http://test.mksearch.mkdoc.org/
couldn't be fetched [200]
[Plugin] 200 - ERROR !!!http://test.mksearch.mkdoc.org/
Thinker task dispatcher dying ...
Spider task dispatcher dying ...
Stopping spider workers...
Worker thread (Spider 0) dying
Stopped spider workers...
Stopping thinker workers...
Worker thread (Thinker 0) dying
Stopped thinker workers...
[Plugin]
SPIDERING SUMMARY :
known urls ............. : 1

   visited urls ........... : 0
     parsed urls ............ : 0
     parse ignored urls ..... : 0
     parse error urls ....... : 0

   not visited urls ....... : 1
     fetching ignored urls .. : 0
     forbidden urls ......... : 0
     fetch error urls ....... : 1

   not yet  visited urls .. : 0
[Plugin] Spidering Stopped
Global Event Dispatcher shutting down.
EventDispatcher for Plugin 'XHTML metadata triple writer plugin for
JSpider' shutting down.
EventDispatcher for Plugin 'XHTML metadata triple writer plugin for
JSpider' shutdown.
Global Event Dispatcher shutdown.
Spidering done!
Elapsed time : 185
sed-linux:~/mksearch/bin$



---------------------------------------------------------
Jeff Albro - Information Technology Manager
Boston University School of Education
jalbro at bu.edu   (617) 358-2966



More information about the MKSearch-dev mailing list