Diff - a2f0db2be27385211f033271d8b83e9caf362236^! - quagga

commit	a2f0db2be27385211f033271d8b83e9caf362236	[log] [tgz]
author	Paul Jakma <paul.jakma@hpe.com>	Thu Feb 25 16:41:56 2016 +0000
committer	Paul Jakma <paul@quagga.net>	Mon Jan 23 18:51:59 2017 +0000
tree	15de9347d92427dd99d7566ccfcf7a787b12dc61
parent	366bb4ab851137e669a2e7db7a45d73b39090249 [diff] [blame]

lib: track worst case # of cycles and don't allow granularity to go above

* The workqueue code at present errs towards optimising the granularity
  for throughput of queue items in runs.  This perhaps is at the cost
  of risking excessive delays at times.  Make the workqueue take
  worst-cases into account.

* thread.c: (thread_should_yield) When thread should yield, we can
  return the time taken for free, as it might be useful to caller.
  work_queue_run

* workqueue.h: (struct work_queue) Add fields for worst # of cycles,
  and (independently) worst time taken.

* workqueue.c: (work_queue_new) Worst starts high.

  (work_queue_run) Track the worst number of cycles taken, where a
  queue run had to yield before clearing out the queue.  Use this as an
  upper-bound on the granularity, so the granulity can never increase.

  Track the worst-case delay per work-queue, where it had to yield, thanks
  to the thread_should_yield return value change.  Note that "show thread
  cpu" already shows stats for the work_queue_run function, inc average and
  worst cases.

Deficiencies:

- A spurious outside delay (e.g.  process not run in ages) could cause
  'worst' to be very low in some particular invocation of a process,
  and it will stay that way for life of process.

- The whole thing of trying to calculate suitable granularities is just
  fragile and impossible to get 100% right.

diff --git a/lib/thread.c b/lib/thread.c
index b65078c..de4d76d 100644
--- a/lib/thread.c
+++ b/lib/thread.c

@@ -1264,8 +1264,8 @@
 thread_should_yield (struct thread *thread)
 {
   quagga_get_relative (NULL);
-  return (timeval_elapsed(relative_time, thread->real) >
-  	  THREAD_YIELD_TIME_SLOT);
+  unsigned long t = timeval_elapsed(relative_time, thread->real);
+  return ((t > THREAD_YIELD_TIME_SLOT) ? t : 0);
 }
 
 void