Commits · aadc55d3cf55ae4e9a37ad5ce59cc32fa77bfb98 · ensembl-gh-mirror / ensembl-hive

This project is mirrored from https://:*****@github.com/Ensembl/ensembl-hive.git. Pull mirroring updated 3 minutes ago.

Jun 13, 2005

changed behaviour so that claiming of jobs preferentially picks jobs that · aadc55d3
Jessica Severin authored 19 years ago
```
have not been run before (< retry_count)
```
aadc55d3

removed global_cleanup from Extensions to RunnableDB. Now RunnableDB subclasses · bcacb2a1

Jessica Severin authored 19 years ago

don't have any ensembl-hive extensions. To get access to extended hive functionality
one must inherit from Hive::Process. A multiple inheritance is allowed like:
our @ISA = qw( Bio::EnsEMBL::Hive::Process Bio::EnsEMBL::Pipeline::RunnableDB );

bcacb2a1

- moved worker/process code related to persistant /tmp/worker_## directory · c8c45241

Jessica Severin authored 19 years ago

  into the Worker object (and out of the Process)
- added Process::worker method so that running processes can talk to the
  worker that is currently running itself.
- modified system so that if a process subclass uses Process::dataflow_output_id
  on branch_code 1, it will turn off the automatic of flowing of the input_job
  out on branch_code 1.  This will make coding much cleaner so that processes
  no longer need to modifiy the input_id of the input_job
- added method Process::autoflow_inputjob which toggles this autoflow behaviour
  if a subclass would like to modify this directly
- auto_dataflow now happens right after the Process::write_output stage

c8c45241

May 26, 2005
- fixed print_job method · d2b10fe5
  Jessica Severin authored 19 years ago
  
  d2b10fe5
Apr 18, 2005

made more Registry friendly by setting -species to be the $dbname · d9eaabe4

Jessica Severin authored 19 years ago

thus allowing one to do hybrid URL/Registry code like...
  if($url) {
    $dbc = Bio::EnsEMBL::Hive::URLFactory->fetch($url, 'compara')->dbc;
    $dbname = $dbc->dbname();
  }
  else { $dbc = Bio::EnsEMBL::Registry->get_DBAdaptor($dbname,'compara')->dbc; }
  $fa = Bio::EnsEMBL::Registry->get_adaptor($dbname,'compara','Family');

ensembl-31-branchpoint

d9eaabe4

Mar 22, 2005
- added alias system that reads from $ENV{'HOME'}."/.hive_url_alias · 1b35d1dc
  Jessica Severin authored 20 years ago
```
which is a simple paired file like.  If file is missing it does nothing
ia64e:3306      127.0.0.1:3371
```
  1b35d1dc
Mar 11, 2005
- updated · 9d1c87d4
  Jessica Severin authored 20 years ago
  
  9d1c87d4
- removed -m lsf option and replaced with a general · 8a3eb697
  Jessica Severin authored 20 years ago
```
-lsf_options <string>
which passes whatever is specified in the string to the bsub command
thus allowing the user complete flexibility
```
  8a3eb697
Mar 08, 2005

consolidated local_beekeeper and lsf_beekeeper into a single script · 2d7324e7

Jessica Severin authored 20 years ago

program has evolved into being the primary portal for user interaction
with the Hive, so 95% of functions are compute resource agnostic so it
makes more sense to have internal switches for which compute resource to
submit/check. Expanded local logic to allow multiple local cpus, submit
into background(fork), check with 'ps', kill with 'kill -9'.

2d7324e7

removed debugging messages. Changed synch wait to 3 minutes when WORKING · 331fc631
Jessica Severin authored 20 years ago

331fc631

Mar 04, 2005

patch for analysis_job_file table: added columns hive_id, retry; changed index · 36b47088
Jessica Severin authored 20 years ago

36b47088

modified analysis_job_file table to better track job diagnostics · 6cd6c078

Jessica Severin authored 20 years ago

added columns hive_id and retry. Allows user to join to failed workers
in the hive table, and to see which retry level the job was at when the
STDOUT/STDERR files were generated. Sets at beginning of job run, and
deletes those for 'empty' files at job end.

6cd6c078

Switched to Tim Cutt's MD5 checksum code for creating worker directories · b0b16a74

Jessica Severin authored 20 years ago

in a more filesystem friendly manner (creates at 256 layer hash which distributes
the directories evenly and reduces concurrent directory modification.
Also reordered how the job output files are saved (done at the beginning right
after redirection starts, and at the end right before it's closed).

b0b16a74

switched all getter/setter methods to use shift methodology to avoid · 1b864401
Jessica Severin authored 20 years ago
```
any problems related to setting undef or '0' values.
```
1b864401

switched output_dir logic back to simple one level since subdir tree actually · 35fc8164

Jessica Severin authored 20 years ago

made the worse (Tim Cutts).  This will do until we figure this out....
I like the '>/dev/null + rerun failed jobs manually with debug' option personally :)

35fc8164

added convenience method parameters (redirects to $self->analysis->parameters). · 00441cbb
Jessica Severin authored 20 years ago

00441cbb

Mar 03, 2005

changed the way the output_dir is created. The hive_id is split so that · 1a3fce82

Jessica Severin authored 20 years ago

each digit becomes a directory with a final directory created with the full hive_id
hive_id=1234 => <base_dir>/1/2/3/4/hive_id_1234/
hive_id=12   => <base_dir>/1/2/hive_id_12/
this should distribute the output directories

1a3fce82

Queen::synchronize_analysis_stats changed the way the AnalysisStats::num_required_workers · b3f126ec
Jessica Severin authored 20 years ago
```
is calculated.  If batch_size>0 use batch_size, else use avg_msec_per_job equation.
```
b3f126ec

created Bio::EnsEMBL::Hive::Process as a processing module superclass alternative · 55cd98de

Jessica Severin authored 20 years ago

to RunnableDB to allow full benefit of dataflow graph capabilities.
- Removed from Extension.pm branch_code, analysis_job_id, reset_job extensions to
  RunnableDB (no longer trying to shoe-horn hive 'extra' functions into them)
- Bio::EnsEMBL::Hive::Process mirrors some of the RunnableDB interface
  (new, analysis, fetch_input, run, write_output)
  but uses a new job interface (input_job, dataflow_output_id) instead of
  input_id (but provides convenience method $self->input_id which redirects to
  $self->input_job->input_id to simplify porting)
- Changed Worker to only use hive 'extended' function if the processing module
  isa(Bio::EnsEMBL::Hive::Process).  Also allows all RunnableDB modules to
  still be used (or any object which implements a minimal 'RunnableDB interface')
  (new, input_id, db, fetch_input, run, write_output)

55cd98de

DEBUG: found a Hive Graph which caused the blocking control logic to fail. · d6e4f9f9
Jessica Severin authored 20 years ago
```
reordered where the blocking checks are done (added, deleted, moved).
```
d6e4f9f9
fixed docs · 5a7600c2
Jessica Severin authored 20 years ago

5a7600c2
added printing of analysis_stats also when queen couldn't create a new worker · fb7e9969
Jessica Severin authored 20 years ago

fb7e9969

added option -analysis_stats which will print the analysis_stats and next · 0ad2158f

Jessica Severin authored 20 years ago

needed workers after this worker is done.  Useful in debugging one's dataflow
and blocking_ctrl graphs by running one worker at a time (like stepping in a debugger)

0ad2158f

added back in auto-flow of input_job to output on call to worker_register_job_done · e5d701ba
Jessica Severin authored 20 years ago

e5d701ba

Mar 02, 2005
- added method flow_output_job, so this can be called independent of finishing · 045f36b1
  Jessica Severin authored 20 years ago
```
a job that has been flowed into an analysis/process
```
  045f36b1
- fixed docs · 3bd7c141
  Jessica Severin authored 20 years ago
  
  branch-ensembl-29
  
  3bd7c141
- fixed perldoc · b271ca2b
  Jessica Severin authored 20 years ago
  
  b271ca2b
Feb 23, 2005
- removed reset_job function since this has been moved inside the Queen · 8bb1141c
  Jessica Severin authored 20 years ago
  
  8bb1141c
- changed reset_job API call · 0c945b3d
  Jessica Severin authored 20 years ago
```
added option -no_pend which ignores the pending_count when figuring out how many workers to submit
removed some superfluous calls to Queen::get_num_running_workers
```
  0c945b3d
- switched to new improved job reset API (Queen::reset_and_fetch_job_by_dbID) · 036deb95
  Jessica Severin authored 20 years ago
  
  036deb95
- reset_job_by_dbID now also resets retry_count and hive_id which is needed · bdcd57f6
  Jessica Severin authored 20 years ago
```
when debugging an analysis which fails and would increment the retry_count.
```
  bdcd57f6
- fixed job reset/claim logic and API. Works better for debugging. · ddbb2ad2
  Jessica Severin authored 20 years ago
  
  ddbb2ad2
- included a sum of failed jobs in the calculation of %completion · 1e5ff11e
  Jessica Severin authored 20 years ago
  
  1e5ff11e
- when hive tables are empty a SUM(...) return NULL so need to check for that · dbf16f37
  Jessica Severin authored 20 years ago
  
  dbf16f37
- added safety check of unclaimed_job_count==0 in order for analysis_stats · 74e7ce80
  Jessica Severin authored 20 years ago
```
to be promoted to 'DONE'
```
  74e7ce80
Feb 22, 2005
- patch to add columns to analysis_stats and analysis_job · 17afec0d
  Jessica Severin authored 20 years ago
  
  17afec0d
Feb 21, 2005

YAHRF (Yet Another Hive ReFactor).....chapter 1 · 7675c31c

Jessica Severin authored 20 years ago

needed to better manage the hive system's load on the database housing all
the hive related tables (in case the database is overloaded by multiple users).
Added analysis_stats.sync_lock column (and correspondly in Object and Adaptor)
Added Queen::safe_synchronize_AnalysisStats method which wraps over the
synchronize_AnalysisStats method and does various checks and locks to ensure
that only one worker is trying to do a 'synchronize' on a given analysis at
any given moment.
Cleaned up API between Queen/Worker so that worker only talks directly to the
Queen, rather than getting the underlying database adaptor.
Added analysis_job columns runtime_msec, query_count to provide more data on
how the jobs hammer a database (queries/sec).

7675c31c

Feb 17, 2005

added method AnalysisStatsAdaptor::increment_needed_workers · af273c18

Jessica Severin authored 20 years ago

called when worker dies to replace itself in the needed_workers count since
it's decremented when it's born, and it's counted as living (and subtracted)
as long as it's running. This gunarantees that another worker will quickly
be created after this one dies (and not need to wait for a synch to happen)

af273c18

added printing of final analysis_stats at end when worker dies (if debug) · 9828df41
Jessica Severin authored 20 years ago

9828df41

Feb 16, 2005

improved dynamic synch logic. Only case where the 5 minute delay is needed · da413295

Jessica Severin authored 20 years ago

is when there are lots of workers 'WORKING' so as to avoid them falling over
each other. The 'WORKING' state only exists in the middle of a large run.
When the last worker dies, the state is 'ALL_CLAIMED' so the sync on death
will happen properly. As the last pile of workers die they will all do
a synch, but that's OK since the system needs to be properly synched when
the last one dies since there won't be anybody left to do it.
Also added 10 minute check for if already 'SYNCHING' to deal with case if
worker dies in the middle of 'SYNCHING'.

da413295