Advertisement
arunk786

cs.xml

Sep 18th, 2011
108
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
XML 7.94 KB | None | 0 0
  1. <?xml version="1.0"?>
  2.  
  3. <!-- This is the configuration file for the resource manager in Hadoop. -->
  4. <!-- You can configure various scheduling parameters related to queues. -->
  5. <!-- The properties for a queue follow a naming convention,such as, -->
  6. <!-- mapred.capacity-scheduler.queue.<queue-name>.property-name. -->
  7.  
  8. <configuration>
  9.  
  10.   <property>
  11.     <name>mapred.capacity-scheduler.maximum-system-jobs</name>
  12.     <value>3000</value>
  13.     <description>Maximum number of jobs in the system which can be initialized,
  14.      concurrently, by the CapacityScheduler.
  15.     </description>    
  16.   </property>
  17.  
  18. <!-- New Config -->
  19. <property>
  20.   <name>mapred.capacity-scheduler.queue.myqueue1.capacity</name>
  21.   <value>25</value>
  22. </property>
  23. <property>
  24.   <name>mapred.capacity-scheduler.queue.myqueue1.maximum-capacity</name>
  25.   <value>50</value>
  26. </property>
  27. <property>
  28.   <name>mapred.capacity-scheduler.queue.myqueue2.capacity</name>
  29.   <value>75</value>
  30. </property>
  31. <property>
  32.   <name>mapred.capacity-scheduler.queue.myqueue2.maximum-capacity</name>
  33.   <value>80</value>
  34. </property>
  35. <!--  
  36.  <property>
  37.    <name>mapred.capacity-scheduler.queue.default.capacity</name>
  38.    <value>100</value>
  39.    <description>Percentage of the number of slots in the cluster that are
  40.      to be available for jobs in this queue.
  41.    </description>    
  42.  </property>
  43.  
  44.  <property>
  45.    <name>mapred.capacity-scheduler.queue.default.maximum-capacity</name>
  46.    <value>-1</value>
  47.    <description>
  48.     maximum-capacity defines a limit beyond which a queue cannot use the capacity of the cluster.
  49.     This provides a means to limit how much excess capacity a queue can use. By default, there is no limit.
  50.     The maximum-capacity of a queue can only be greater than or equal to its minimum capacity.
  51.        Default value of -1 implies a queue can use complete capacity of the cluster.
  52.  
  53.        This property could be to curtail certain jobs which are long running in nature from occupying more than a
  54.        certain percentage of the cluster, which in the absence of pre-emption, could lead to capacity guarantees of
  55.        other queues being affected.
  56.        
  57.        One important thing to note is that maximum-capacity is a percentage , so based on the cluster's capacity
  58.        the max capacity would change. So if large no of nodes or racks get added to the cluster , max Capacity in
  59.        absolute terms would increase accordingly.
  60.    </description>    
  61.  </property>
  62.  
  63.  
  64.  <property>
  65.    <name>mapred.capacity-scheduler.queue.default.supports-priority</name>
  66.    <value>false</value>
  67.    <description>If true, priorities of jobs will be taken into
  68.      account in scheduling decisions.
  69.    </description>
  70.  </property>
  71.  
  72.  <property>
  73.    <name>mapred.capacity-scheduler.queue.default.minimum-user-limit-percent</name>
  74.    <value>100</value>
  75.    <description> Each queue enforces a limit on the percentage of resources
  76.    allocated to a user at any given time, if there is competition for them.
  77.    This user limit can vary between a minimum and maximum value. The former
  78.    depends on the number of users who have submitted jobs, and the latter is
  79.    set to this property value. For example, suppose the value of this
  80.    property is 25. If two users have submitted jobs to a queue, no single
  81.    user can use more than 50% of the queue resources. If a third user submits
  82.    a job, no single user can use more than 33% of the queue resources. With 4
  83.    or more users, no user can use more than 25% of the queue's resources. A
  84.    value of 100 implies no user limits are imposed.
  85.    </description>
  86.  </property>
  87.  
  88.  <property>
  89.    <name>mapred.capacity-scheduler.queue.default.user-limit-factor</name>
  90.    <value>1</value>
  91.    <description>The multiple of the queue capacity which can be configured to
  92.    allow a single user to acquire more slots.
  93.    </description>
  94.  </property>
  95.  
  96.  <property>
  97.    <name>mapred.capacity-scheduler.queue.default.maximum-initialized-active-tasks</name>
  98.    <value>200000</value>
  99.    <description>The maximum number of tasks, across all jobs in the queue,
  100.    which can be initialized concurrently. Once the queue's jobs exceed this
  101.    limit they will be queued on disk.  
  102.    </description>
  103.  </property>
  104.  
  105.  <property>
  106.    <name>mapred.capacity-scheduler.queue.default.maximum-initialized-active-tasks-per-user</name>
  107.    <value>100000</value>
  108.    <description>The maximum number of tasks per-user, across all the of the
  109.    user's jobs in the queue, which can be initialized concurrently. Once the
  110.    user's jobs exceed this limit they will be queued on disk.  
  111.    </description>
  112.  </property>
  113.  
  114.  <property>
  115.    <name>mapred.capacity-scheduler.queue.default.init-accept-jobs-factor</name>
  116.    <value>10</value>
  117.    <description>The multipe of (maximum-system-jobs * queue-capacity) used to
  118.    determine the number of jobs which are accepted by the scheduler.  
  119.    </description>
  120.  </property>
  121. -->
  122.  
  123.   <!-- The default configuration settings for the capacity task scheduler -->
  124.   <!-- The default values would be applied to all the queues which don't have -->
  125.   <!-- the appropriate property for the particular queue -->
  126.   <property>
  127.     <name>mapred.capacity-scheduler.default-supports-priority</name>
  128.     <value>false</value>
  129.     <description>If true, priorities of jobs will be taken into
  130.       account in scheduling decisions by default in a job queue.
  131.     </description>
  132.   </property>
  133.  
  134.   <property>
  135.     <name>mapred.capacity-scheduler.default-minimum-user-limit-percent</name>
  136.     <value>100</value>
  137.     <description>The percentage of the resources limited to a particular user
  138.       for the job queue at any given point of time by default.
  139.     </description>
  140.   </property>
  141.  
  142.  
  143.   <property>
  144.     <name>mapred.capacity-scheduler.default-user-limit-factor</name>
  145.     <value>1</value>
  146.     <description>The default multiple of queue-capacity which is used to
  147.     determine the amount of slots a single user can consume concurrently.
  148.     </description>
  149.   </property>
  150.  
  151.   <property>
  152.     <name>mapred.capacity-scheduler.default-maximum-active-tasks-per-queue</name>
  153.     <value>200000</value>
  154.     <description>The default maximum number of tasks, across all jobs in the
  155.     queue, which can be initialized concurrently. Once the queue's jobs exceed
  156.     this limit they will be queued on disk.  
  157.     </description>
  158.   </property>
  159.  
  160.   <property>
  161.     <name>mapred.capacity-scheduler.default-maximum-active-tasks-per-user</name>
  162.     <value>100000</value>
  163.     <description>The default maximum number of tasks per-user, across all the of
  164.     the user's jobs in the queue, which can be initialized concurrently. Once
  165.     the user's jobs exceed this limit they will be queued on disk.  
  166.     </description>
  167.   </property>
  168.  
  169.   <property>
  170.     <name>mapred.capacity-scheduler.default-init-accept-jobs-factor</name>
  171.     <value>10</value>
  172.     <description>The default multipe of (maximum-system-jobs * queue-capacity)
  173.     used to determine the number of jobs which are accepted by the scheduler.  
  174.     </description>
  175.   </property>
  176.  
  177.   <!-- Capacity scheduler Job Initialization configuration parameters -->
  178.   <property>
  179.     <name>mapred.capacity-scheduler.init-poll-interval</name>
  180.     <value>5000</value>
  181.     <description>The amount of time in miliseconds which is used to poll
  182.     the job queues for jobs to initialize.
  183.     </description>
  184.   </property>
  185.   <property>
  186.     <name>mapred.capacity-scheduler.init-worker-threads</name>
  187.     <value>5</value>
  188.     <description>Number of worker threads which would be used by
  189.     Initialization poller to initialize jobs in a set of queue.
  190.     If number mentioned in property is equal to number of job queues
  191.     then a single thread would initialize jobs in a queue. If lesser
  192.     then a thread would get a set of queues assigned. If the number
  193.     is greater then number of threads would be equal to number of
  194.     job queues.
  195.     </description>
  196.   </property>
  197.  
  198. </configuration>
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement