This project is mirrored from https://:*****@github.com/Ensembl/ensembl-hive.git. Pull mirroring updated .
  1. 03 Mar, 2005 1 commit
  2. 02 Mar, 2005 1 commit
  3. 23 Feb, 2005 3 commits
  4. 21 Feb, 2005 1 commit
    • Jessica Severin's avatar
      YAHRF (Yet Another Hive ReFactor).....chapter 1 · 7675c31c
      Jessica Severin authored
      needed to better manage the hive system's load on the database housing all
      the hive related tables (in case the database is overloaded by multiple users).
      Added analysis_stats.sync_lock column (and correspondly in Object and Adaptor)
      Added Queen::safe_synchronize_AnalysisStats method which wraps over the
        synchronize_AnalysisStats method and does various checks and locks to ensure
        that only one worker is trying to do a 'synchronize' on a given analysis at
        any given moment.
      Cleaned up API between Queen/Worker so that worker only talks directly to the
        Queen, rather than getting the underlying database adaptor.
      Added analysis_job columns runtime_msec, query_count to provide more data on
        how the jobs hammer a database (queries/sec).
      7675c31c
  5. 17 Feb, 2005 1 commit
    • Jessica Severin's avatar
      added method AnalysisStatsAdaptor::increment_needed_workers · af273c18
      Jessica Severin authored
      called when worker dies to replace itself in the needed_workers count since
      it's decremented when it's born, and it's counted as living (and subtracted)
      as long as it's running.  This gunarantees that another worker will quickly
      be created after this one dies (and not need to wait for a synch to happen)
      af273c18
  6. 16 Feb, 2005 4 commits
  7. 10 Feb, 2005 1 commit
  8. 11 Jan, 2005 1 commit
  9. 08 Jan, 2005 1 commit
  10. 14 Dec, 2004 1 commit
  11. 09 Dec, 2004 1 commit
  12. 25 Nov, 2004 2 commits
  13. 24 Nov, 2004 1 commit
  14. 20 Nov, 2004 1 commit
    • Jessica Severin's avatar
      New distributed Queen system. Queen/hive updates its state in an incremental · e3d44c7e
      Jessica Severin authored
      and distributed manner as it interacts with the workers over the course of its life.
      When a runWorker.pl script starts and asks a queen to create a worker the queen has
      a list of known analyses which are 'above the surface' where full hive analysis has
      been done and the number of needed workers has been calculated. Full synch requires
      joining data between the analysis, analysis_job, analysis_stats, and hive tables.
      When this reached 10e7 jobs, 10e4 analyses, 10e3 workers a full hard sync took minutes
      and it was clear this bit of the system wasn't scaling and wasn't going to make it
      to the next order of magnitude. This occurred in the compara blastz pipeline between
      mouse and rat.
      Now there are some analyses 'below the surface' that have partial synchronization.
      These analyses have been flagged as having 'x' new jobs (AnalysisJobAdaptor updating
      analysis_stats on job insert).  If no analysis is found to asign to the newly
      created worker, the queen will dip below the surface and start checking
      the analyses with the highest probablity of needing the most workers.
      This incremental sync is also done in Queen::get_num_needed_workers
      When calculating ahead a total worker count, this routine will also dip below
      the surface until the hive reaches it's current defined worker saturation.
      A beekeeper is no longer a required component for the system to function.
      If workers can get onto cpus the hive will run.  The beekeeper is now mainly a
      user display program showing the status of the hive.  There is no longer any
      central process doing work and one hive can potentially scale
      beyond 10e9 jobs in graphs of 10e6 analysis nodes and 10e6 running workers.
      e3d44c7e
  15. 09 Nov, 2004 2 commits
    • Jessica Severin's avatar
      reformated code (removed all the tabs) · 088529b5
      Jessica Severin authored
      088529b5
    • Jessica Severin's avatar
      refactored synchronization logic to allow for worker distributed syncing. · e6fb56d1
      Jessica Severin authored
      The synchronization of the analysis_stat summary statistics was done by
      the beekeeper at the top of it's loop.  For graphs with 40,000+ analyses
      this centralized syncing became a bottle neck.  This new system allows
      the Queen attached to each worker process to synchronize it's analysis.
      Syncing happens when a worker 'checks in' and when it dies.  The sync on
      'check in' only updates if the stats are >60secs out of date to prevent
      over syncing.
      The beekeeper still needs to do whole system syncs when a subsection has
      finished and the next section needs to be 'unblocked'.  For homology this
      will happen 2 times in a 16 hour run.
      e6fb56d1
  16. 20 Oct, 2004 1 commit
  17. 12 Oct, 2004 1 commit
  18. 11 Aug, 2004 2 commits
  19. 06 Aug, 2004 1 commit
  20. 03 Aug, 2004 1 commit
  21. 16 Jul, 2004 2 commits
  22. 15 Jul, 2004 1 commit
  23. 14 Jul, 2004 1 commit
  24. 13 Jul, 2004 3 commits
  25. 09 Jul, 2004 1 commit
    • Jessica Severin's avatar
      changed Queen->create_new_worker method to use rearrange formating. · 54927cb4
      Jessica Severin authored
      Also added functionality so that runWorker can be run without
      specification of an analysis.  The create_new_worker method now will
      query for a 'needed worker' analysis from the AnalysisStats adaptor when
      the analysis_id is undef.  This simplifies the API interface between the
      Queen and the beekeepers.  Now the beekeeper only needs to receive a count
      of workers.  The workers can still be run with explicit analyses for
      testing or situations where one wants to manually control the processing.
      Now one can simply do
      bsub -JW[1-100] runWorker -url mysql://ensadmin:<pass>@ecs2:3361/compara_hive_jess_23
      to create 100 workers which will become whatever analysis that needs to be done.
      54927cb4
  26. 17 Jun, 2004 1 commit
  27. 16 Jun, 2004 1 commit
  28. 14 Jun, 2004 2 commits